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SECTION  I 
INTRODUCTION 


Many  spaceborne,  re-entry,  boost  ballistic  and  aircraft  digital  proc- 
essors are  required  to  operate  in  a radiation  environment.  The  use  of 
microprocessors  as  preprocessors  in  such  applications  could  result  in  an 
increase  in  performance  and  flexibility  through  the  use  of  standard,  high 
density  Large  Scale  Integration  (LSI)  computing  elements.  However,  the 
application  of  presently  available  microprocessors  is  restricted  to  unhard- 
ened systems.  The  solution  to  this  dilemma  was  the  subject  of  this  study 
which  was  broken  down  into  three  areas:  the  study  of  several  applications  to 
determine  the  preprocessor  computing  requirements;  the  study  of  the  elec- 
trical performance  and  its  degradation  in  radiation  of  several  device  tech- 
nologies; the  study  of  available  microprocessor  characteristics,  perform- 
ance, and  architecture.  The  results  of  these  studies  are  recommendations 
for  the  technology  and  architecture  of  a microprocessor  suitable  for  use  as 
a preprocessor  in  a radiation  hardened  application. 

The  major  sections  of  this  report  address  the  three  study  areas  men- 
tioned previously.  Section  n describes  the  computing  requirements  which 
were  determined  for  three  applications:  Kalman  filtering,  radar  beamsteer- 
ing and  the  user  equipment  computer  for  the  Global  Positioning  System. 
Section  III  describes  the  electrical  and  radiation  performance  of  the  TTL, 
ECL,  IIL,  and  MOS  device  technologies.  Several  examples  of  the  industry- 
wide hybrid  circuit  capability  are  also  included.  The  state-of-the-art  in 
microprocessor  technology  is  reviewed  in  Section  IV.  Tradeoff  areas  in 
computer  architecture  are  also  discussed.  The  available  bit  slice  devices 
are  compared,  and  examples  of  preprocessor  architecture  are  given. 

Section  V contains  a summary  and  the  conclusions  of  the  study. 


SECTION  II 

APPLICATION  STUDIES 


A.  NAVIGATION  PREPROCESSING 

Military  navigation  systems  have  different  configurations  depending  on 
their  use  and  sensor  complement.  Such  factors  as  manned  or  unmanned 
mission,  aircraft,  space  or  missile  application,  tend  to  influence  the  types 
of  sensors  used,  amount  of  sensor  redundancy,  precision  requirements, 
form  of  outputs,  etc. 

The  two  basic  navigation  methods  are  dead  reckoning  and  position 
fixing.  Dead  reckoning  involves  extrapolation  of  a known  position  to  some 
future  time,  based  on  the  knowledge  and/or  measurement  of  position  rates, 
e.  g.  , velocity.  Position  fixing  involves  determination  of  position  without 
reference  to  any  previously  derived  position.  In  most  systems,  both  naviga- 
tion methods  are  required,  since  long  term  dead  reckoning  would  lead  to 
excessive  errors  without  correction  fixes.  In  some  cases,  involving  missile 
terminal  phases,  position  fixing  is  either  not  required,  or  the  dead  reckoning 
and  fixing  is  combined  to  provide  closed  loop  terminal  guidance.  The  sensor 
complement  generally  associated  with  dead  reckoning  navigation  consists  of 
a true  air  speed  indicator  a doppler  or  correlation  navigator  and  an  inertial 
measurement  unit.  Typical  position  fixing  sensors,  providing  either  con- 
tinuous or  periodic  fixes,  are  TACAN,  LORAN,  OMEGA,  GPS  (Global 
Positioning  System),  star  trackers  and  mapping  radars. 

In  the  older  systems,  the  vehicle  position,  including  attitude , if  appli- 
cable, was  usually  derived  by  processing  each  sensor's  data  independently, 
and  then  either  combining  the  results  to  reduce  the  system  errors  or  using 
them  independently  to  provide  redundancy  in  the  case  of  sensor  or  processor 
failure. 

New  navigation  systems,  on  the  other  hand,  tend  to  use  multisensor 
processing,  usually  employing  a combinational  form  of  a Kalman  filter.  This 
filter,  normally  implemented  in  a recursive  form,  yields  an  optimum  esti- 
mate of  vehicle  state  (e.g.  position,  velocity,  attitude,  etc. ),  based  on  the 
measurements  from  all  sensors. 

The  usefulness  of  the  Kalman  filter  extends  beyond  the  determination 
of  vehicle  position  coordinates.  Since,  in  the  process  of  execution  of  the 
algorithm,  the  Kalman  filter  continuously  estimates  the  error  variances, 
these  can  be  used,  especially  in  the  multisensor  case,  for  error  correction 
or  sensor  recalibration. 
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In  the  past,  Kalman  filter  algorithms  were  normally  solved  as  a part  of 
the  total  navigation  problem  either  in  a dedicated  Navigation  Computer  or  in 
the  vehicle  Central  Computer.  This  has  led  in  many  cases  to  exorbitant 
software  development  costs.  The  present  trend,  strongly  stimulated  by  the 
advent  of  microprocessors,  is  toward  the  decentralization  of  computers  so 
that  it  is  possible  that  a widespread  function  such  as  a Kalman  filter  will  be 
implemented  in  a stand-alone  processing  module.  For  example,  considering 
the  C-5  navigation  computer  implementation!*  in  Figure  1,  the  functions 
within  the  dashed  lines  constitute  a potential  Kalman  filter  processor.  The 
remaining  functions  consist  of  the  sensor  data  conditioning/transformation 
which  could  be  implemented  within  the  sensor  preprocessors,  and  the  hori- 
zontal navigation  function,  which  would  be  either  stand  alone  or  a part  of  a 
larger  navigation  function.  It  should  be  noted  that  this  form  of  partitioning 
allows  the  use  of  the  Kalman  filter  processor  in  either  direct  or  indirect 
mode,  the  latter  being  used  in  the  C-5  aircraft.  This  approach  eliminates 
the  need  for  any  analog-to-digital  or  digital -to- analog  conversion  and  allows 
the  use  of  standard  data  bus  at  the  Kalman  filter  processor  interface. 

1.  Kalman  Filter  Principles 
a.  Mathematics 


The  use  of  a Kalman  filter  in  a navigation  system  provides  a 
means  of  optimally  estimating  the  vehicle  states,  e.  g.  three  dimensional 
position  components,  velocity  components  etc.  Since  the  computational 
algorithm  continuously  updates  the  variances  of  system  state  errors, 
additional  states  such  as  boresight  errors,  and  gyro  drift  biases  can  also 
be  introduced  for  the  purpose  of  calibration,  compensation  and  alignment. 


The  Kalman  filter  algorithm  provides  the  solution  to  the 
differential  (difference)  equations  describing  the  dynamic  system  of  the  form 
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The  measurements  of  the  system  are  described  by 
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*A11  references  are  listed  in  Appendix  A of  this  document. 


Figure  1.  Functional  Block  Diagram  of  C-5  Kalman  Navigation/Alignment  Mechanism 
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system  state  vector  at  time  (sample)  n 
state  transition  matrix 

vector  of  random  excitation  (referenced  toX^  frame) 
state  measurement  vector  at  time  n 
measurement  transformation  matrix 
measurement  noise 


The  above  matrix  equations  are  generally  solved  in  the  re- 
cursive manner,  in  which  the  previous  estimate  ofX  is  combined  with  a 
measurement  to  arrive  at  a minimum  variance  estimate. 


The  optimum  estimate  X ofX,  given  measurements  Yn,  Yn_j, 
etc.  is  given  by 


where 
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K = gain  matrix 
n A 

Yn  = -H  $ i Xn  i = rn  = residual,  or  a difference  between  the 

new  measurements  and  estimated 
measurements 


The  gain  matrix,  chosen  to  provide  the  minimum  variance  solution  is  com- 
puted from: 
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where, 
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where 


Pn  = covariance  matrix  prior  to  observation  (measurement) 

/\ 

P = estimated  covariance  matrix  after  observation 
n 

Rn  = sensorerror  covariance  matrix 
Qn  = excitation  noise  covariance  matrix 

Software  implementation  of  the  Kalman  filter  involves  the  solution  of  the 
matrix  equations  (3)  through  (6).  A typical  execution  of  these  equations  is 
shown  in  the  flow  diagram  of  Figure  2. 

Table  1 summarizes  the  dimensions  of  Kalman  filter  matrices 
and  resultant  storage  requirements. 

Typical  elements  of  the  state  vector,  Xn,  include  desired 
system  states  and  correlated  system  parameters.  For  example,  the  C-5A 
implementation  included  the  following  16  elements  in  the  state  vector: 

3 velocity  errors 
3 position  errors 
3 platform  altitude  errors 
3 gyro  drift  rate  errors 
1 accelerometer  error 
1 doppler  ground  speed  error 
1 doppler  drift  angle  error 

1 barometric  altitude  reference  error  

The  procedure  used  in  deriving  other  Kalman  filter  matrices 
is  based  on  expressing  the  navigation  system  dynamics,  i.e. , equations  of 
motion,  in  terms  of  these  state  variables  using  linear  equations.  Then,  the 
elements  of  the  transition  matrix,  $n , are  obtained  by  taking  partial  deriva- 
tives of  these  equations  with  respect  to  the  state  variables.  Similarly,  the 
elements  of  the  measurement  matrix,  ITn,  are  obtained  by  taking  partial 
derivatives  of  equations  relating  the  measurements  to  system  states  with  re- 
spect to  the  state  variables. 

The  covariance  matrix,  Pn,  represents  the  variance  and  co- 
variance  values  of  the  system  states.  Frequently  an  assumption  can  be  made 
that  there  is  no  cross-correlation  between  the  state  variables  initially.  In 
that  case,  the  initial  value  of  this  matrix,  P0,  will  have  only  diagonal  terms, 
each  representing  an  expected  variance  of  the  respective  state.  Similarly, 
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Figure  2.  Kalman  Filter  Program  Flow 


TABLE  1.  KALMAN  FILTER  MATRICES 
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in  most  applications,  it  is  adequate  to  assume  no  cross  correlation  between 

different  measurements,  resulting  in  the  diagonal  form  of  the  covariance 

matrix  of  the  measurement  noise,  R . 

’ n 


b.  Basic  Filter  Forms 


There  are  two  basic  methods  of  utilizing  Ka  man  filters  in  navi- 
gation systems:  either  as  a direct  or  an  indirect  filter.  The  choice  has  a 
considerable  impact  on  the  complexity  of  the  resultant  algorithms,  which  in 
turn  affects  the  amount  of  computational  resources  required. 

The  direct  filter  implementation,  shown  in  Figure  3,  involves 
direct  computation  of  vehijle  parameters  (position,  velocity  and  possibly 


8 


* 


f 


A 


altitude)  by  including  them  in  the  filter  states.  This  method  is  commonly  \ 

used  in  space  navigation,  particularly  in  those  systems  which  do  not  utilize  1 

an  inertial  platform.  In  those  systems,  even  though  the  output  states  are 
included  in  the  filter's  state  vector,  the  measurements  (e.g.  star  tracker 
sightings)  are  usually  far  apart  so  that  the  measurement  errors  may  be  con- 
sidered uncorrelated  and  no  additional  state  variables  need  be  considered. 

Thus,  in  a typical  space  system,  the  state  vector  has  six  parameters  (three 
position  and  three  velocity  components).  On  the  other  hand,  a direct  ilter 
implementation  for  aircraft  navigation  yields  a very  complex  solution  since 
additional  states  must  be  included  due  to  high  frequency  correlated  errors 
such  as  accelerometer  bias  and  gyro  drift  rates.  In  addition,  filtei  solution 
rates  are  considerably  higher  since  the  smoothed  vehicle  position  computa- 
tions must  follow  the  high  frequency  components  of  the  vehicle  motion. 


To  circumvent  these  computational  problems,  aircraft  systems 
usually  employ  an  indirect  filter  implementation,  shown  in  Figure  4.  In  this 
case  the  Kalman  filter  estimates  only  the  error  states  of  the  inertial  naviga- 
tion system,  rather  than  the  states  of  the  vehicle  directly.  Since  the  inertial 
platform  has  excellent  short  term  accuracy,  filter  sampling  rates  are  de- 
termined by  the  navigation  augumentation  sensors  such  as  LORAN,  OMEGA 
GPS,  etc. , which  may  range  from  seconds  to  minutes.  Thus  while  the  filters 
for  airborne  systems  are  characterized  by  a large  number  of  states  (usually 
of  the  order  of  10  or  more),  the  solution  rates  are  sufficiently  low  to  allow 
for  low  performance  computer  implementation. 

2.  Computer  Requirements 

a.  Data  Rates  and  Precision 

The  Kalman  filter  processing  cycle  depends  on  whether  the 
filter  is  used  directly  or  indirectly  and  on  the  type  of  reference  sensors  used. 
Table  2 lists  typical  iteration  intervals  for  various  sensor  types  assuming  an 
indirect  filter  mode  in  which  the  filter  operates  on  longer  term  system  state 
errors  rather  than  following  short  term  vehicle  dynamics.  Based  on  that 
data,  one  could  assume  a typical  5 s iteration  cycle  for  establishing  proc- 
essing requirements. 

Word  precision  is  directly  affected  by  the  system  accuracy  re- 
quirements. Thus  21-bit  precision  is  required  to  represent  a system  with  1 s 
of  arc  altitude  accuracy,  neglecting  the  truncation  and  round-off  errors. 
Internal  processor  precision  must  be  higher,  considering  the  covariance 
matrix  computations  involving  the  square  of  the  errors.  For  example,  in  the 
case  of  the  C-5  system,  in  order  to  represent  the  range  of  squared  values  of 
platform  tilts  (from  1 s of  arc  to  1°),  24  bits  were  required  not  including 
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Figure  3.  Direct  Kalman  Filter 
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Figure  4. 
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TABLE  2.  KALMAN  FILTER  - ITERATION  INTERVALS 
FOR  POSITION  CORRECTIONS  (FIXES) 

LORAN  /OMEGA  10  s (after  prefiltering) 

GPS  6 s 

Doppler  with  Strapdown  System  1-1.5  s (worst  case) 

Star  Tracker  6-10  minutes 


truncation/round-off  errors.  To  allow  for  the  latter,  in  a single  precision 
processor,  28-30  bits  are  desirable.  Double  precision  and/or  floating  point 
capability  offers  ways  of  getting  around  this  problem  at  a cost  of  hardware 
complexity.  Floating  point  capability  also  avoids  manual  scaling  of  the 
data  which  can  be  quite  tedious  because  of  the  multiplicity  of  matrix  opera- 
tions including  inversion. 

b.  Algorithmic  Content  Analysis 

Examination  of  Kalman  filter  equations  (3)  through  (6)  and 
computational  flow  in  Figure  2 reveals  the  need  for  an  instruction  repertoire 
capable  of  efficient  execution  of  matrix  manipulation  problems.  Maximum 
efficiency  and  speed  are  required  in  execution  of  the  following  basic  in- 
structions: 

Load/Store 

Add/ Subtract 

Multiply 

Conditional  and  Unconditional  Branches 

Indexing  (including  at  least  3 hardwired  indexable  registers) 

Desirable  macro  instructions  or  efficient  subroutines  are  as 

follows: 

Matrix  Addition/Subtraction 

Matrix  Multiplication 

Matrix  Inversion  (e.g.  Gauss-Jordan  algorithm) 

Scalar-Vector  Product 
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c.  Computational  Loading  Analysis 

J.  M.  Mendel 's^  article  provides  a useful  method  for  a quick 
estimate  of  Kalman  filter  computational  loading.  The  results  of  this  analysis 
are  pessimistic  in  a sense  that  all  matrices  and  vectors  are  assumed  to  be 
full  and  that  all  the  computations  are  performed  once  per  iteration.  On  the 
other  hand,  Mendel's  model  does  not  include  precomputation  of  matrix  ele- 
ments and  other  preprocessing  functions  which  may  be  required. 

The  computational  loading  estimates  are  expressed  in  terms  of 
n = number  of  state  variables 
r = number  of  measurements 
s = number  of  system  disturbances 

The  data  is  based  on  programming  a hypothetical  computer  with  instruction 
times  as  shown  in  Table  3.  It  should  be  noted  that  the  instruction  times  are 
expressed  in  terms  of  unit  times  (e.g.  memory  cycles)  except  for  multiply, 
MUL,  and  divide,  DIV,  to  allow  for  their  wide  variations  in  various  computers. 

Mendel  defines  the  total  computation  time,  C(n,r,s,MUL  DIV) 
as  follows:  ’ 

C(n,  r,  s,  MUL,  DIV)  = [M(n,  r,  s,  )•  MUL+2A(n,  r,  s)+L(n,  r,  s,  MUL,  DIV)1TU 


where 

M(n,r,  s)  = total  number  of  multiplications  (See  Figure  5) 

A(n,r,s)  = total  number  of  additions  (See  Figure  6) 

L(n,  r,  s,  MUL,  DIV)  = total  logic  time  (See  Figure  7) 

Tu  = basic  unit  time  (e.g.  1 /is/unit  time) 

For  example,  consider  a typical  aircraft  navigation  problem 
which  has  14  states,  six  measurements  and  one  disturbance.  Then  from 
Figures  5,  6 and  7, 
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TABLE  3.  COMPUTER  MODEL  USED  IN  ANALYSIS 


Instruction 


Execution  Time 
(in  unit  times) 


ADD  TO  A 2 

SUB  FROM  A 2 

LOAD  A OR  B 2 

STORE  A OR  B 2 

MULTIPLY  A WITH  MEMORY  MUL 

DIVIDE  A BY  MEMORY  DIV 

MARK  PLACE  AND  TRANSFER  2 

RETURN  BRANCH  2 

ALL  OTHER  (E.G.  TRANSFERS,  1 

INDEX  REGISTER  INCR. ) 
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M(n,r,s)  = 12  x 103 

A(n,  r,  s)  = 11. 4 x 10^ 

L(n,r,s,  MUL, DIV)  = 11.8  x 104  + 1.6  x 103  + 80  DIV  +35  MUL 

S 11. 96  x 104  + 80  DIV 

and 

C(n,  r,  s,  MUL,  DIV)  = [1.2  x 104  MUL  + 2.28  x 104  + 11.96  x 104 

+ 80  DIVl  T 

u 

Assuming  Tu  = 1 /is/unit  time  (corresponding  to  2 /is  add 
time)  one  obtains  a plot  of  processing  time  as  a function  of  the  multiply  time, 
shown  in  Figure  8.  This  result  shows  that  even  present  day  microprocessors 
which  lack  fast  multiply  capability  can  be  used  for  a wide  range  of  Kalman 
filter  problems. 

In  addition,  processing  loads  can  be  reduced  in  some  systems 
through  the  use  of  such  techniques  as 

(1)  Using  iteration  rates  other  than  covariance  and  filter  gain 
updating  for  measurements  updating. 

(2)  Precomputation  of  filter  gain  based  on  a priori  data, 
d.  Storage  Requirements 

Mendel's  article  can  also  be  used  for  estimating  the  processor 
storage  requirements  for  solving  the  Kalman  filter  problem.  Figure  9 plots 
the  total  (program  and  data)  storage  requirement  as  a function  of  n,  r and  s. 

For  the  preceding  example  of  n = 14,  r = 6 and  s = 1 ? one  ob- 
tains from  Figure  9 the  following  estimate: 

S(n,r,s)  = 1200  + 50  + 489  = 1739  words 

This  estimate  compares  with  220  words  of  storage  allocated  to  the  C-5  Kalman 
filter  problem3  (in  which n = 16,  r = 6).  In  general,  in  addition  to  the  basic 
Kalman  filter  algorithm,  some  additional  preprocessing  or  post  processing 
functions  may  be  included.  Therefore  in  many  systems  up  to  4K  words  of 
memory  are  usually  allotted  for  this  problem. 
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PROCESSING  TIME  - SECONDS 


Figure  8.  Kalman  Filter  Processing  Time  as  a Function 
of  Multiply  Execution  Time 


e.  Summary  of  Microprocessor  Requirements 

Based  on  the  preceding  analysis,  the  desirable  microprocessor 
characteristics  for  the  Kalman  filter  are  shown  in  Table  4.  These  are  based 
on  the  14-state  system,  assuming  an  average  Kalman  filter  processing  cycle  of 
1 to  5 s and  allows  for  30%  processor  utilization  to  accommodate  some  addi- 
tional peripheral  ^motions.  While  the  required  15  to  100  /is  multiply  times 
are  realizable  at  the  upper  range  of  the  scale  with  some  available  bit- slice 
microprocessors  (e.g.  MMI  5701,  Intel  3000  series),  a fast  multiply  add-on 
option  may  be  desirable  for  more  stringent  applications.  It  should  be  noted 
that  a bit- slice  architecture  is  practically  mandatory  for  such  reasons  as 
wide  word  precision,  memory  addressing  and  flexibility. 

B.  RADAR  APPLICATIONS 

The  elements  in  the  signal  processing  chain  in  a multimode  radar  are 
the  antenna,  transmitter/receiver  and  signal  processor.  Associated  with  each 
of  these  elements  are  corresponding  control  units  performing  antenna  control, 
timing  and  mode  control,  and  processor  control.  Possible  radar  applications 
for  microprocessors  exist  as  controllers  and  signal  processors. 

As  controllers,  the  microprocessor  requirements  for  the  radar  appli- 
cation are  similar  to  other  equipments.  Logical  and  conditional  branching 
operations  are  performed  more  frequently  than  straightforward  arithmetic 
computations.  The  controller  throughput  requirements  are  low  since  changes 
in  the  external  environment,  reflected  in  operator  mode  changes,  are  slow  and 
infrequent. 

The  antenna  control  function  was  examined  in  detail  as  a typical  appli- 
cation of  microprocessors  in  radars.  A brief  examination  of  the  signal 
processing  requirements  was  sufficient  to  deter  detailed  study  in  that 
area. 


The  processing  and  control  functions  which  are  required  in  the  antenna 
area  are  shown  in  Figure  10.  The  Scan  Control  Unit  (SCU)  examines  the 
command  inputs  and  determines  the  antenna  pointing  angles,  pattern  and 
polarization.  Frequency  information  from  the  transmitter  exciter  is  passed 
on  to  the  Beamsteering  Computer.  The  Stabilization  Computer  uses  the 
aircraft  attitude  information  to  stabilize  the  pointing  angle  command.  The 
Beamsteering  Computer  determines  the  phase  shift  weights  for  each  antenna 
element.  The  computers  need  not  be  separate  hardware  entities  as  implied 
by  Figure  10.  The  processing  requirements  for  the  Beamsteering  Computer 
are  examined  in  the  following  paragraphs.  The  antenna  is  assumed  to  be  a 
phased  array. 
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TABLE  4.  KALMAN  FILTER  PROCESSOR  REQUIREMENTS 

I i ^ 


Architecture 
Word  Length 

Double  Precision 

Floating  Point 

Instruction  Execution  Times 
Add 

Multiply  (Subroutine) 
Input /Output 

Basic  Instruction  Set 


Desirable  Macros 


Addressing  Modes 


Number  of  index  (or  Indexable) 
Registers 


Bit- Slice 
24  bits  minimum 

30  bits  (without  double  precision) 
Yes 

Desirable 


2 jis 

1 5-100  jus 

1000  words  per  second 
Add/Sub 

Logical,  Compare,  Branches 
Shifts 

Subroutine  call  with  return 
Fast  Multiply  Option  (5-10  jus) 

Matrix  Add,  Multiply 
Trigonometric  (if  sensor  preprocessing, 
and  other  navigation  functions  included) 

Direct  - 4K  words 

Indirect 

Indexed 

3 minimum 


i 

I 


' 

* 

i 


i 
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For  a pencil  beam  and  a row 
computations  are  straightforward, 
axis  phase  tapers  are  found  from 


and  column  antenna,  the  beamsteering 
For  each  beam  position,  the  x and  y 


2 7i  Dx  . . o 

A 4>x  = - — sm  a sin  B 


an 


A0  y = v—  sin  a cos  B = b^ 


where 

A 0x  = adjacent  element  phase  shift  along  x-axis 


A$y  = adjacent  element  phase  shift  along  y-axis 
Dx  = element  spacing  along  x-axis 
Dy  = element  spacing  along  y-axis 
X = radar  wavelengths 
a,  B = pointing  angles 
j,k  = element  index 

The  required  phase  angle  for  each  antenna  element  is  found  from 


<p{ j,k)  = j Acfx  +k  A</>y  = tt  (ja  + kb) 

Typical  problem  parameters  are  shown  in  Table  5.  The  array  consists 
of  a large  number  of  phase  shift  elements.  In  the  worst  case  the  antenn 
position  and  radar  frequency  will  change  on  a pulse-to-pulse  basis  so .that 
the  phase  angle  of  every  antenna  element  would  have  to  be  calculated  duri  g 

the  pulse  repetition  interval. 

A computer  flow  diagram  for  the  phase  angle  computation  is  shown  in 
Figure  11  The  computations  are  simple:  the  phase  tapers 
themput  variables,  and  the  element  phase  angles  are  determined  using  two 
indexed  loops.  The  inner  loop  (k  index)  is  executed  for  each  antenna  element 
and  consists  of  an  increment,  addition,  output  and  t com_ 

sidering  the  other  functions  as  overhead  operation,  the  computer  must  com 
plete  four  operations  for  every  antenna  element  during  every  solution  inter- 
val.  The  required  speed  in  operations  per  second  is 

OPS/SOLUTION  INTERVAL  = _±_  = 12  MOPS 

OPS/S  = SOLUTION  TIME/ELEMENT  PR17N 
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TABLE  5.  TYPICAL  BEAMSTEERING  PARAMETERS 


I 

I 


* 


i 


I 

i 


1 


! 


f 

X 


N 

1500 

Antenna  Elements 

XL 

50 

x-axis  Columns 

\ 

30 

y-axis  Rows 

PRI 

500/iS 

Solution  Interval 

m 

5 

Phase  Shifter  Word  Width 

Dx,  Dy 

X/2 

Interelement  Spacing 

Similarly,  the  output  bit  rate  is 


OUTPUT  RATE  = 


NUMBER  ELEMENTS  (BITS/ELEMENT) 
INTERPULSE  PERIOD 


Nm 

PRI 


15  Mbps 


The  computer  word  size  may  be  determined  by  minimum  beam  movement, 
pointing  accuracy  or  truncation  considerations.  In  the  latter  case,  the  com- 
puter calculations  are  done  with  n bit  words,  while  the  phase  shifter  requires 
m bits.  The  minimum  expressable  angle  (and  the  maximum  error)  in  the 
computer  is  2u/2n.  The  minimum  phase  shifter  angle  is  2'n/2m.  If  the 
phase  shifts,  <£(j,k),  are  computed  by  an  addition  only  algorithm  [<£(j,k)  = 
0(j-l,k)  +7i  a],  then  the  round  off  error  in  the  </(l,  1)  calculation  can  propa- 
gate to  the  ^(X^,  Y^)  element.  The  maximum  error  at  that  element  is 


H = X, 


^l)  + Y = (X  + Y ) ^ZL 

2n  L 2n  L L'  2n 


This  error  should  be  less  than  the  minimum  phase  shifter  angle,  2tt  /2m . 
Therefore, 


V 


(27T) 

2n 
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or 


n-m  > log2  (XL  - Yl) 


with  the  given  parameters  the  minimum  word  length  is 
n > 5 + log2  80  = 5 + 6.32  = 11.32 
n = 12 

The  computer  requirements  for  a single  computer  solution  to  the  beam- 
steering problem  are  summarized  in  Table  6.  Although  simple  computation- 
ally, the  solution  time  per  element  is  quite  low  (0.  33  /is)  and  results  in  very 
high  throughput  and  I/O  rates. 


TABLE  6.  SINGLE  COMPUTER  BEAMSTEERING  REQUIREMENTS 


I/O  Rate 

15  Mbps 

Throughput 

12  MOPS 

Program  Memory 

200  Words 

Temporary  Storage 

8 Registers 

ALU  Width 

12  Bits 

I/O  Devices 

1500 

To  reduce  the  rates  to  more  practical  levels,  assume  that  the  antenna 
array  is  divided  into  S sections  of  N/S  elements.  Assign  one  computer  to 
each  section,  so  that  the  system  consists  of  S computers.  The  increase  in 
hardware  reduces  the  computer  requirements  as  shown  in  Table  7.  For  a 
microprocessor  implementation,  a computer  for  each  array  column  (or  row) 
might  be  a feasible  solution . 

Radar  beamsteering  requires  very  high  throughput  and  I/O  rates  if  the 
radar  requires  pulse-to-pulse  phase  shift  computations  and  the  number  of 
antenna  elements  is  large.  Lower  pulse  repetition  frequencies,  a smaller 
antenna,  or  a fixed  transmitter  frequency  would  reduce  the  computer 
requirements. 
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TABLE  7.  MULTIPLE  COMPUTER  BEAMSTEERING  REQUIREMENTS 


Computer 

Count 

Solution 

Time/Element 

lis 

I/O  Rate 
Mbps 

Throughput 

MOPS 

- - — 

1 

0.33 

15 

12 

3 

1.00 

5 

4 

15 

5.00 

1 

0.75 

30 

10.00 

0.5 

0.40 

45 

15.00 

0.3 

0.27 
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C.  COMMUNICATIONS  APPLICATION 


The  application  studied  was  the  user  equipment  for  the  Global 
Positioning  System  (GPS).  The  purpose  of  the  GPS  is  to  allow  the  user  to 
accurately  determine  his  position  via  satellite  communications.  The  com- 
pleted system  will  consist  of  24  satellites  in  different  orbital  paths  above  the 
earth.  The  user  of  GPS  will  be  able  to  receive  information  from  at  least  four 
satellites  at  any  given  time  and  location  on  earth.  Each  satellite  will  trans- 
mit its  precise  position  at  some  given  time  (altitude,  longitude  and  latitude 
at  tj).  By  knowing  their  times  and  positions,  the  user  can  calculate  his  posi- 
tion at  a given  time  (four  equations  and  four  unknowns).  System  character- 
istics are  given  in  Table  8. 

Because  of  the  size,  weight  and  cost  objectives,  a microprocessor 
may  be  the  optimum  choice  for  a processor.  However,  its  specification  is  not 
trivial.  The  choice  of  a processor  will  greatly  affect  the  size  of  program 
memory  which  is  very  sensitive  to  cost.  Therefore  the  data  accuracies,  speed, 
and  algorithms  must  be  thoroughly  defined. 

In  order  to  express  the  altitude  of  the  satellites  with  an  accuracy  of 
several  feet,  a processor  word  of  at  least  24  bits  is  required.  Allowing 
margin  for  round  off  and  error  propagation,  32 -bit  arithmetic  would  be 
adequate.  This  precision  can  be  met  either  with  a 32-bit  processor  or  with 
a 16-bit  processor  with  double  precision  capability. 
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TABLE  8.  GPS  SPECIFICATIONS 


Total  Number  of  Satellites 

Overall  Pseudo  Noise  Sequence  Period 

Each  Satellite  Pseudo  Noise  Sequence 

Period  (32  unique  sequence  possibilities) 


Information  Frame  Time 

Number  of  Bits/Frame  (Word) 

Transmitting  Frequency 

Maximum  Doppler 

Perigee  Altitude 

Apogee  Altitude 

Period 

Eccentricity 

Duration 

Semimajor  Axis 

Accuracy  (one  SIGMA  prediction  error) 

Acquisition  Time 

Display  Update  Time 

Size 

Weight 

Cost 

Temperature 


24 

267  days 
7 days 

6,  12,  18,  24,  30,  36  seconds 

300,  600,  900,  1200,  1500,  1800 

1.5  GHz 

3 kHz 

20,  000  KM 

20,  000  KM 

12  hours 

0- . 005 

5 years 

26,  500  KM 

18  ft.  maximum 

5 minutes 

1 minute 

18”  x 12”  x 6” 

8-25  lbs. 

$15,000  to  $30,000 
-40  to  +155°F 
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The  functions  to  be  performed  in  the  user  equipment  processor 
are  shown  in  the  flow  diagram  in  Figure  12.  The  user  will  initialize  the 
processor  and  estimate  his  current  position  and  time.  Using  stored  data, 
the  processor  will  determine  the  position  of  all  operational  satellites,  and 
select  a set  of  four  transmitters  based  on  mission  performance  require- 
ments. The  processor  will  provide  data  to  the  receiver  for  satellite  acqui- 
sition. This  information  will  include  estimated  range  and  doppler  and  the 
associated  pseudo  noise  (pn)  code  which  is  unique  to  a given  satellite.  The 
satellite  data  is  encoded  using  a pn  code  so  that  a secure  communications 
link  can  be  achieved.  The  received  data  is  checked  and  corrected  for  atmos- 
phere errors,  GPS  clock  errors,  and  calibrated  receiver  delays.  The 
position  calculation  is  performed  and  the  results  are  filtered.  The  position 
data  is  scaled  to  the  appropriate  units  and  formatted  for  the  output  display 
device.  The  processor  then  continues  to  monitor  all  satellite  positions,  so 
that  new  satellites  can  be  acquired  as  old  ones  move  out  of  range.  Portions 
of  the  software  are  described  in  the  following  paragraphs. 

The  Satellite  Selection  Routine  requires  the  calculation  of  the 
satellite's  positions.  The  orbits  have  very  small  eccentricity,  so  they  can  be 
assumed  to  be  circular.  Assuming  that  the  satellite's  current  period,  T.,  is 
known,  and  that  its  position  at  some  other  time  at  t^)  is  also  known,  1then 
its  present  position,  6 , is 

e,(t)  = p-  (t-t)  +e  l = 1,2 24 

1 Ai  1 1 0 \ 6 < 2 it 

Thus,  in  its  own  planar  coordinate  system,  its  position  (x.,  y.)  is 


x. '  = R.  cos  9 . 

i i i 

y. '  = R.  sin  where  R.  = orbital  radius. 

Ji  i i i 

These  coordinates  can  be  transformed  to  the  user's  inertial  coordinate  system 
(or  latitude,  longitude,  altitude)  through  the  appropriate  direction  cosine 
transform; 


ali 

a2i 

*i 

II 

a3i 

a4i 

z. 

1 . 

_a5i 

a6i_ 

i = 1,2,...  24 


USER  INPUT 


• RESET  SYSTEM 

• INPUT  ESTIMATED  POSITION 


DETERMINE  POSITION  OF  ALL  SATELLITES 
SELECT  TRANSMITTER  SUBSET 


• PROVIDE  ACQUISITION  DATA  TO  RECEIVER 

• INPUT  RECEIVER  DATA 


CHECK, CORRECT  INPUT  DATA 
CALCULATE  POSITION 
FILTER  RESULTS 


SCALE  AND  FORMAT  DATA 
OUTPUT  DATA 


Figure  12.  GPS  Processor  Software 
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or 


r . aiP; 

where  the  aij  are  the  direction  cosine  between  the  plane  of  the  satellite's  orbit 
and  the  user^s  coordinate  system.  This  position  calculation  will  require  stor- 
age of  240  constants  (24  x 10)  and  will  result  in  a position  array  of  72  entries 
(24  x 3). 

Another  portion  of  the  Satellite  Selection  Routine  is  the  selection 
of  a subset  of  four  of  the  satellites  to  provide  the  position  information.  The 
selection  criteria  could  be  based  on  the  four  closest  in  range  or  on  the  princi- 
ple called  Geometric  Dilution  of  Precision  (GDOP).  GDOP  is  a phenomenon 
which  results  when  the  intersecting  surfaces  which  define  the  user's  position 
are  nearly  tangent.  Although  the  intersection  is  a point  mathematically,  the 
real  world  effects  of  finite  precision  result  in  decreased  accuracy. 

A flow  diagram  of  the  Satellite  Selection  software  is  shown  in 
Figure  13.  Rather  than  evaluating  the  selection  criterion  for  each  of  the  24 
satellites,  an  initial  partial  ordering  is  performed  which  restricts  those 
subsequently  examined  to  the  set  in  the  viewable  hemisphere.  The  output  of 
this  procedure  is  a list  of  four  satellites  which  best  satisfy  the  selection 
criterion. 

The  Receiver  Interface  software  examines  the  selection  list  and 
provides  the  receiver  with  the  estimated  range  and  doppler  of  the  four  satel- 
lites. The  processor  also  tells  the  receiver  which  of  32  possible  pn  codes  each 
satellite  is  using. 

After  the  receiver  has  acquired  the  four  satellites,  the  user's 
position  can  be  determined  by  solving  the  four  pseudo- range  equations: 

3 

y <Vu,)2=  (r.-b)2  1=1, 2, 3, 4 

j=l 

th 

= j—  component  of  position  of  the  user 

til  til 

= j—  component  of  position  of  the  i—  satellite 

th 

= pser  j range  between  the  i— satellite  and  user 
= user's  clock  bias  (units  of  range) 


where 


u- 


x. 


i] 
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REDUCE  WORKING  SET 


CALCULATE  ORDERING  CRITERIA 


ORDER  WORKING  SET 


SELECT  BEST  FOUR 


TO  RECEIVER  INTERFACE 


Figure  13.  Satellite  Selection  Software 


With  the  definitions 

u. 


u = 


u. 


u„ 


L oj 


dr  = 


X. 


i=  i 


i] 


the  last  three  pseudo  range  equations  can  be  linearized  by  subtracting  the 
first,  with  the  result 

Au  = br  - ^ v 


where 


A = 


21  “ X11 

X22"X12 

X23"X13 

3rxn 

X32  " X12 

X33"X13 

41 " X11 

X42"X12 

X43~X13 

Thus,  the  components  of  u can  be  expressed  in  terms  of  the  other  unknown, 
b: 


a ~ 

c. 

1 

1 

u=bA1r-^A1v=  ba  + c = b 

a2 

+ 

c2 

1 

s» 

CO 

o 

co 

1 

This  relation  can  be  substituted  in  the  first  psuedo- range  equation, 
3 


to  obtain  a quadratic  equation  in  b, 


I 


xij ' (ajb  + ci> 


= (r  1 - b)' 


j = l L 

which  can  be  solved  using  the  quadratic  formula.  The  user's  position,  u,  can 
then  be  found  by  substituting  back  into  the  relation 


u = ba  + c 

This  method  of  solution  requires  the  inversion  of  a 3 x 3 matrix  and  a square 
root  operation  for  each  position  fix. 

The  data  filter  may  take  many  forms.  One  possible  implementation  is 


_f(n)  = T(n,n-1)  f(n-l)  + G(n)u 

where 

fjn)  ~ filtered  position  output  at  time  n 
T(n,n-1)  = transition  matrix  from  n-1  to  n 
G(n)  = gain  matrix 
u = current  solution 


34 


' , ■ — I ■ 


I 


t 

I 


I 

f 


This  filter  mechanization  linearly  combines  the  previous  filter 
output  with  the  new  solution  point.  The  matrices  may  be  variable  quantities 
which  depend  on  the  system's  estimate  of  the  quality  of  the  data. 

The  computer  requirements  for  the  GPS  application  are  sum- 
marized in  Table  9.  The  memory  storage  estimates  are  shown  in  detail  in 
Table  10.  The  throughput  and  I/O  estimates  are  moderate  due  to  the  rela- 
tively long  time  (~6  s)  between  position  inputs.  In  addition  to  standard 
trigonometric  and  square  root  subroutines,  a matrix  inversion  procedure  is 
required.  Double  precision  arithmetic  is  required  if  a 16  bit  processor  is 
used. 

The  computer  requirements  have  been  shown  for  two  types  of 
processors.  The  first  machine  has  an  eight  bit  instruction  and  a 32-bit  data 
word.  An  example  of  this  type  of  machine  is  a modified  CRD-8.  The  CRD-8 
is  a General  Electric  design  of  an  8-bit  microcomputer  that  has  been  con- 
structed using  SSI  and  MSI  TTL.  For  this  application,  the  arithmetic- register 
section  of  the  Central  Processing  Unit  (CPU)  would  be  extended  from  eight  to 
32  bits.  When  an  instruction  is  read  from  memory,  only  the  upper  eight  bits 
of  the  32  bit  data  path  are  used  to  decode  the  operation.  All  arithmetic  opera- 
tions use  the  full  32  bits.  Memory  space  can  be  minimized  by  dividing  the 
memory  into  separate  instruction  and  data  sections.  However,  the  user  is 
still  limited  to  an  8 bit  instruction. 

The  second  processor  is  a 16-bit  machine  with  double  precision 
arithmetic  capability.  The  16  bit  instruction  word  has  added  capability  over  an 
eight  bit  instruction  at  the  cost  of  additional  memory.  The  throughput  require- 
ment is  higher  for  this  machine  since  more  operations  are  required  to  arrive 
at  a 32  bit  answer.  The  DEC  LSI- 11  is  a typical  microprocessor  based 
machine  which  could  be  applied  to  this  application.  The  LSI- 11  can  emulate 
the  PDP11-40  minicomputer  and  it  has  a floating  point  option.  Extensive 
software  support  is  also  available. 
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TABLE  9.  GPS  COMPUTER  REQUIREMENTS 


Memory  Size 
Throughput 
I/O  Rate 
Word  Size 
Instruction 
Data 

Double  Precision 
Subroutines 

Instruction  Set 


Address  Mod 
Options 


10,240  (8  bit) 

6144  (16  bit) 

100  KOPS 

300  KOPS 

100  Kbps 

100  Kbps 

8 

16 

32 

16 

No 

Yes 

Trig 

Square  Root 
Matrix  Inverse 

General  Reg  Architecture 

Arithmetic/Logic 

Mult/Div 

Subroutines 

Branch 

Stack 

Direct,  Indirect,  Index 
Stack,  Register  Pointer 

Floating  Point 


Ik 


TABLE  10.  MEMORY  STORAGE  REQUIREMENTS 


8-Bit  16  Bit 

Instruction  Instruction 


Program 


User  Input 

128 

96 

Select  Four 

256 

160 

RCVR  Control 

512 

425 

User  Position 

512 

425 

Filter 

1024 

800 

Output 

256 

256 

Control 

512 

425 

Subroutines 

512 

425 

Conversions 

256 

150 

Spec.  User  Calc. 

512 

400 

Growth,  Misc. 

1664 

534 

Constants 

512  (32  bits) 

1024 

Scratch  pad 

512  (32  bits) 

1024 

6144  ( 8 bits) 

6144  (16  bits) 

1024  (32  bits) 

10,  240  ( 8 bit  bytes) 

II 

i 
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SECTION  m 
TECHNOLOGY  STUDIES 


A.  PERFORMANCE  CHARACTERISTICS 


The  electrical  performance  characteristics  of  the  microprocessor 
candidate  technologies  are  summarized  in  Table  11,  and  are  discussed  in 
the  following  paragraphs.  Commercial  microprocessors  are  available  in  all 
technologies  except  dielectrically  isolated  Transistor-Transistor  Logic  (TTL). 

1.  Emitter  Coupled  Logic  - ECL 

The  on-chip  propagation  delay  for  ECL  10000  is  nominally  2.  5 ns 
which  can  rise  to  3.  7 ns  over  temperature.  The  off-chip  delay  of  3.  5 ns  is 
with  a 50  pf  load.  The  power  dissipation  includes  the  terminating  resistor 
dissipation  in  the  off-chip  case.  The  second  power  supply,  -2V,  is  re- 
quired for  the  terminating  resistors.  Power  dissipation  is  essentially  con- 
stant over  the  military  temperature  range  and  independent  of  speed  from 
1 - 100  MHz.  The  high  level  noise  immunity  is  0. 125V  and  the  low  level 
noise  immunity  is  0. 150V  at  25° C.  Off-chip  terminating  resistors  are  re- 
quired to  prevent  line  ringing.  The  logic  levels  for  the  technology  are  -0.  9 
and  -1.8V.  The  negative  supply,  which  represents  a departure  from  normal 
saturated  logic  operation,  is  used  to  minimize  the  effects  of  noise  that  appears 
on  the  voltage  supply  line,  Vee  in  this  case.  Normally,  the  ground  bus  has 
the  least  amount  of  noise  in  a system.  The  logic  output  levels  are  closely 
associated  with  the  Vqq  terminals,  and  if  minimum  noise  exists  on  this  line, 
little  effect  will  be  noted  on  the  signal  waveshapes.  Noise  appearing  on  the 
Vee  supply  will  be  attenuated  by  bout  75%  due  to  the  resistor  ratios  that 
exist  in  the  circuit.  Thus,  noise  that  exists  on  the  V^e  line  will  not  have 
nearly  the  effect  on  the  signals  as  equivalent  noise  on  the  Vqq  lines. 

A positive  5 V power  supply  can  be  used  if  care  is  taken  to  mini- 
mize noise  on  the  +5V  supply  line.  Translators  are  available,  however,  to 
interface  between  negative  ECL  levels  and  positive  saturated -logic  levels, 
and  it  is  normally  desirable  to  operate  in  this  mode.  The  seemingly  arbitrary 
value  of  5.2V  was  selected  to  optimize  speed/power  relationships  and  to 
allow  three  levels  of  differential-amplifier  operation  for  increased  logic 
efficiency. 

The  logic  swing  exhibited  by  ECL  is  significantly  smaller  than  that 
of  saturated  logic  and  may  seem  undesirable  to  work  with.  Saturated  logic  de- 
signers are  accustomed  to  typical  TTL  logic  swings  of  approximately  3 to  3.5  V, 
while  those  of  ECL  are  on  the  order  of  850  mV.  However,  for  high-speed 
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operation,  smaller  signal  swings  are  beneficial  from  several  standpoints. 
Advantages  are  that  properties  of  dV/dt  and  dl/dt  are  minimized  to  cut  down 
crosstalk  between  lines.  Further,  with  smaller  signal  swings,  transmission 
lines  are  much  easier  to  handle.  The  use  of  transmission  lines  can  be  very 
beneficial  at  higher  speeds. 


ECL  is  a mature  technology.  The  fabrication  process  requires 
seven  masks  and  five  diffusions.  Two  layers  of  interconnect  are  available: 
one  aluminum  layer  and  one  for  diffused  cross-unders. 

The  first  ECL,  bit-slice  microprocessor  was  recently  announced 
by  Motorola.  The  power  and  silicon  area  requirements  of  the  technology  have 
undoubtedly  contributed  to  the  time  span  between  the  ECL  and  MOS  product 
announcements.  The  M10800  microprocessor  family  is  microprogram mable, 
and  fast  75  ns  instruction  cycle  times  are  claimed.  Sample  quantities  are 
scheduled  for  early  1976. 

2.  Schottky  TTL 

Adding  Schottky  barrier  diodes  to  the  saturating  transistors  in  a 
basic  TTL  gate  reduces  stored  charge  in  the  transistors,  resulting  in  higher 
speed  operation.  Internal  gates  in  anSTTL  chip  can  use  fewer  transistors  than 
interface  gates,  resulting  in  lower  power  dissipation  and  less  silicon  area. 

The  technology  is  mature,  although  product  lines  with  different  speed  power 
products  are  available  from  various  manufacturers.  The  process  requires 
seven  masks  and  four  diffusions,  and  two- layer  interconnect  is  available. 

3.  Integrated  Injection  Logic  - IIL 


IIL  is  a recent  bipolar  technology  development  which  offers  high 
packing  density  and  a wide  range  speed-power  tradeoff.  An  example  of  the 
packing  density  capability  of  IZL  is  shown  in  Figure  14.  The  p-injector  used 
to  power  the  flop  is  indicated  along  the  drawing's  horizontal  axis.  Note  that 
an  entire  #L  static  flip-flop,  which  would  require  six  4-by-4  mil  bonding  pads 
interfaced  to  the  outside  world,  would  virtually  fit  under  one  of  those  bonding 
pads.  It  is  this  compact  geometry  that  accounts  for  the  high  component  density 
of  the  l2L  LSI  processor  designs  currently  in  production.  Practical  IZL  gates 
can  handle  a range  of  six  magnitudes  or  more  of  injector  current  - from  pico- 
amperes  to  microamperes  - at  speeds  ranging  from  hundreds  of  microseconds 
to  tens  of  nanoseconds.  They  can  be  powered  up  for  maximum  speed,  then 
powered  down  by  a magnitude  of  100  to  10, 000  without  losing  functions  or  data 
(if  they  are  memories).  They  do  not  display  increased  power  dissipation  with 
frequency  nor  produce  the  switching  noise  transients  common  in  standard 
push-pull  logic  built  with  CMOS  or  TTL.  They  require  neither  gold  doping  nor 


40 


..v- ...  - ..  ... 


D TYPE  FF  USING  STD  NAND  GATE 


Figure  14.  IIL  Design  Example 
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Schottky  clamping,  as  does  conventional  logic  in  which  transistors  are  easily 
saturated. 


2 

I L logic  is  fully  static,  requiring  no  multiphase  clocks.  Tem- 
perature stability  is  superior,  with  circuits  capable  of  military  temperature 
range  operation  from  -55°C  to  125°C. 

Full  TTL  compatibility  can  be  achieved  with  the  circuits  shown 
in  Figure  15.  A nominal  input  threshold  of  1.  5V  can  be  achieved  with  two 
10K  Q,  resistors  functioning  as  a voltage  divider. 

These  high-impedance,  high-threshold  characteristics  were  chosen 
to  reduce  input  loading  and  to  increase  the  input  noise  margin  over  a standard 
TTL  input,  yet  they  retain  full  compatibility  with  all  5-V  logic  families.  The 
I2L  inputs  also  utilize  an  input- clamping  diode  to  limit  negative  excursions,  or 
ringing,  on  the  receiving  end  of  a transmission  line. 

Standard  design  practice  for  I2L  is  again  patterned  after  TTL  in 
that  I2L  outputs  are  generally  over-designed  by  100%.  While  a typical  i2l  out- 
put will  sink  40  mA  without  pulling  out  of  saturation,  the  outputs  of  i2l  logic 
circuits  are  guaranteed  to  sir k 20  mA  (10  Schottky  TTL  loads)  at  400  mV 
maximum  under  worst-case  conditions.  The  high  logic  output  level,  output 
rise  times,  and  input  noise  immunity  are  determined  by  the  discrete  pull-up 
resistor. 

o 

I L is  relatively  easy  to  process,  requiring  only  four  masks  and 
two  diffusions.  Although  a commercial  logic  family  is  not  available,  the  high 
packing  density  makes  the  technology  a natural  for  LSI.  The  Texas  Instru- 
ments microprocessor,  SBP0400,  contains  more  than  1450  gates. 

4.  Complementary  MOS  - CMOS 

The  on-chip  propagation  delay  in  CMOS  circuits  is  dependent  on 
the  gate  capacitive  loading.  Power  dissipation  is  dynamic  in  nature,  depend- 
ing directly  on  capacitive  load  and  switching  frequency.  Static  power  dissipa- 
tion is  very  low,  which  makes  CMOS  very  attractive  for  low  speed,  battery 
powered  applications.  Noise  immunity  is  excellent  with  30%  of  the  supply 
voltage  guaranteed.  The  CMOS,  silicon  gate  process  is  a mature  technology; 
processing  requires  six  masks  and  three  diffusions. 

Sapphire  can  also  be  used  as  a substrate  material  for  CMOS  cir- 
cuits, instead  of  the  conventional  bulk  silicon.  The  great  advantage  of  CMOS 
on  SOS  is  that  it  can  be  built  into  configurations  that  are  two  to  three  times 
denser  than  bulk  silicon  circuits,  with  two  to  three  times  more  speed.  More- 
over, power  dissipation  at  higher  speeds  is  greatly  reduced,  yielding  a 


42 
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cults  suitable  both  for  LSI  logic  armlir^Hnn  * ■ lgh'PeJ'f°rminS  family  of  cir- 
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with  the  bipoUr  tectaologieTsabIl  thl  Sh°W  that,the  hlghest  s^e6  ls  obtained 
polar  LSI  has  been  ‘ the  eXpfnse  of  P°«r  and  silicon  area.  Bi- 

Whereas  a 4K  bit  RAM  in  i?  aPPJoximately  one  fourth  the  size  of  MOS  LSI 
a"  ECL  a 256  hk  ™ m ""  dem°nstra“*1  "ith  MOS,  bipolar  Schottky  Ls 
Schottkv  alkiwed  a H ■?  , 0ry  delrlce-  The  fatroduction  of  low-power 
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features  of  high  speed,  go^  density 

tary  temperature  raifge^ceramichpackae^n  ^00^  °n  * tW°  inp“l.  gate’  mili' 

no  costs  are  available  for  t2t  fho^  ■ g n 100  or  more  quantities.  Although 
competitive  with  the  other  terhnnWi 18  no  ^eason  to  suspect  that  it  would  not  be 
dielectrically  isolated  TTT  f es*  ,^be  most  glaring  cost  entry  is  for 

technologies  ’ aPPro!“mately  aix  times  the  gate  cost  of  the  other 

B.  RADIATION  PERFORMANCE 

exam ineLLf tL'LrLrm^ce" leveT fb!rlar "eld  e«aot  devices  were 
marized  in  Table  12  SunnorMno-  nata  t e ™rious  technologies  are  sum- 
obtained  from  references  wilhMw?  v ? th!  ?vels  siven  ln  Table  ™ were 

s^rarN£iS  3f 

gra^^lLu^Um  daii^e^ mechan^s^s'and^^^^iati^^1^!^0^^0^'3'"0^10 

various  technologies  in  detail.  d the  radiation  Performance  of  the 

1 . Bipolar  Technologies 

a.  Damage  Mechanisms 

Irradiation  of  bipolar  transistors  causes: 

(1>  fn^c®  charge  accum“lktion  in  the  passivation  layer  and 
interface  state  generation, 
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(2)  majority  carrier  removal  at  the  radiation  induced  defects , 

(3)  ionization  of  the  semiconductor  material  in  the  vicinity  of 
the  circuit  p-n  junctions. 

Circuit  performance  degrades  due  to: 

(1)  transistor  gain  degradation, 

(2)  bulk  silicon  resistivity  increase, 

(3)  transient  photocurrent  generation  causing  transient  or 
possibly  permanent  circuit  malfunction. 

Three  different  factors  contribute  to  transistor  gain  degradation  in 
bipolar  circuits.  The  first  two  occur  as  a result  of  neutron  bombardment. 

The  first  mechanism  is  a reduction  in  the  minority  carrier  lifetime  in  the 
base  region.  In  order  for  the  transistor  to  remain  functional,  the  base  tran- 
sit time  must  be  much  shorter  than  the  minority  carrier  lifetime.  As  the 
neutron  fluence  increases,  the  minority  carrier  lifetime  approaches  the  base 
transit  time,  eventually  reducing  the  gain  of  the  transistor  to  zero.  A second 
mechanism  is  the  increase  in  base  current  resulting  from  increased  carrier 
recombination  in  the  emitter  space  charge  region.  Normally,  this  mechanism 
is  dominant  only  at  low  injection  levels.  A third  mechanism  is  the  increased 
surface  recombination  resulting  from  an  increase  in  interface  state  density. 

Both  of  these  latter  mechanisms  are  especially  important  at  high  fluences 
( >1013  N/cm2)  for  IIL. 

Gain  degradation  with  neutron  fluence  is  shown  in  Figure  16  for 
several  transistors  with  different  gain- band  width  products  (fx).  Transistors 
with  high  fT  usually  have  smaller  geometries  and  shorter  base  widths,  so  that 
a higher  fluence  is  required  to  make  the  minority  carrier  lifetime  approach  the 
base  transit  time. 

Defects  formed  by  neutron  bombardment  in  bulk  silicon  tend  to  act  as 
trapping  and  recombination  cenh>rs,  thereby  increasing  the  bulk  resistivity. 

The  effective  majority  carrier  concentrations  tend  to  decrease  and  approach 
the  intrinsic  level  at  a rate  oroportional  to  the  neutron  fluence.  The  bulk 
resistivity  is  inversely  related  to  the  doping  level,  so  that  bulk  resistivity 
increases  with  neutron  fluencr  An  example  of  this  effect  is  shown  in  Fig- 
1116  l^j  A0  which  the  initial  donor  concentration  in  the  n-type  material  was 
2.  5 x 1015  cm-3,  corresponding  t<  a resistivity  of  2 ohm-centimeter.  The 
material  becomes  intrinsic  with  a resistivity  of  2.2  x 105  ohm-cm  at  a critical 
neutron  fluence  of  approximately  5 x 10*4  neut/cm2.  A minimum  doping  con- 
centration can  be  determined  if  the  neutron  fluence  for  the  specified  environment 


Neutron  Fluence  Effect  on  Resistivity 


is  known.  For  example,  if  the  specified  fluence  level  is  1014  neut/cm2,  the 
minimum  donor  level  is  just  under  1015  cm- 3 in  vacuum  float  zone  silicon 
and  approximately  5 x 10*4  cm-3  in  Czochralski  silicon. 

High  levels  of  ionizing  radiation  generate  electron  hole  pairs  in  the 
vicinity  of  each  p-n  junction,  resulting  in  transient  currents  which  can  charge 
and  discharge  circuit  nodes.  The  transient  current  is  proportional  to  the  p-n 
junction  area  and  the  transient  dose  rate.  Computer  models  can  be  used  to 
simulate  the  effects  of  transient  radiation  in  bipolar  and  field  effect  circuits. 

b.  Logic  Family  Performance 
(1)  Emitter  Coupled  Logic 

There  are  several  potential  problems  associated  with  the 
use  of  ECL  circuits  in  a radiation  environment. 

(a)  They  are  not  available  off-the-shelf  in  a dielectrically 
isolated  form,  even  in  small  scale  integrated  circuits. 

(b)  The  input  base  resistor  is  high  (see  Figure  18 ) so  that 
a relatively  low  photocurrent  through  the  resistor  will 
bias  the  input  transistor  on,  possibly  resulting  in  an 
erroneous  output. 

(c)  All  transistors  operate  in  their  linear  region,  making 
the  circuit  more  susceptible  to  hpg  degradation. 

(d)  The  logic  levels  are  separated  by  only  800-900  mV 
and  the  relatively  low,  0. 125  V,  noise  immunity 

is  likely  to  be  aggravated  under  radiation  exposure. 

The  MC  1679L,  a four  bit  MECL  III  counter,  has  been  tested 
in  various  radiation  environments 4 . The  circuit,  shown  in  Figure  19,  can 
perform  the  divide  by  two,  divide  by  five,  or  divide  by  ten  functions. 

A divide  by  ten  configuration  was  used  in  FXR  and  LINAC 
tests;  the  lowest  failure  level  was  obtained  when  all  bits  of  the  counter  were  in 
a logic  "1"  state.  When  exposed,  the  "l's"  changed  to  "0's"  at  a level  of 
3 x 108  Rads(Si)/s  for  all  seven  samples  tested.  No  failures  were  observed  at 
2 x 108  Rads(Si)/s  and  identical  performance  was  obtained  for  both  static  and 
dynamic  operations  (920  MHz  clock  rate).  In  the  wide  pulse  (4  jus)  LINAC  tests, 
the  devices  failed  by  dropping  "1"  between  1 and  2x10°  Rads(Si)/s,  and  when 
operated  at  a 2.5  MHz  rate,  they  failed  to  count  between  1.1  - 1.4  108  Rads(Si)/s. 
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Figure  19.  MC  1678  Counter 


The  circuit  also  exhibited  unexpectedly  long  recovery  times. 
After  exposure  to  10^  Rads(Si)/s,  the  circuit  did  not  resume  counting  cor- 
rectly until  1 pts  after  the  burst.  When  exposed  to  a 4. 5 /is  LINAC  pulse  at 
2 x 1()9  Rads(Si)/s,  the  circuit  required  14  /is  to  recover.  No  catastrophic 
damage  was  observed  at  the  GAMBLE  n test  facility  at  a dose  rate  up  to 
1.  5 x lOH  Rads(Si)/s,  the  highest  actually  attained. 

The  circuit  operated  amazingly  well  in  the  neutron  environ- 
ment, to  which  several  were  exposed.  The  observed  failure  level  was  1. 9-2.5 
x 1()15  N/cm2.  In  failing,  the  logic  level  separation  decreased  from  900  mV  to 
300  at  1.  9 x 10*5  N/cm*,  and  the  circuit  stopped  counting,  in  some  cases 


abruptly. 


(2)  Emitter  Follower  Logic 


Emitter  Follower  Logic  operates  at  medium  speeds  (2-40 
MHz),  and  was  initially  developed  by  TRW  and  is  now  being  built  by  Motorola. 
Devices  are  not  yet  commercially  available.  Table  13  shows  a list  of  circuits 
now  under  development  at  Motorola^  . 


TABLE  13.  EFL  CIRCUIT  TYPES 


♦Gate  equivalency  is  derived  by  dividing  the  device  count  by  3.  Three  being  the  number  chosen  as 
components  per  3 input  nand  gate. 


The  3D,  EFL  structure^  is  shown  in  Figure  20  and  is 
responsible  for  the  favorable  response  of  EFL  in  a radiation  environment. 
The  circuit  uses  a p-type  substrate,  vertical  NPN  and  PNP  transistors  and 
buried  N-type  resistors.  Since  the  p-type  emitters  in  the  PNP's,  the 
p bases  in  the  NPN's  and  the  p region  overlying  the  n-type  resistors  are 
heavily  doped  at  the  surface,  the  circuit  is  effectively  immune  to  surface 
effects  in  SiC^. 


The  transistors  employ  relatively  wide  bases  which  re- 
sult in  large  transistor  gain  degradation  under  neutron  bombardment.  How- 
ever, the  design  rules  allow  operation  of  the  PNP's  with  gains  of  2 and  NPN's 
with  gains  of  3 before  loss  of  logic  margin  occurs. 

Various  unhardened  LSI  EFL  circuits  have  been  radiation 
tested  with  the  following  results  ^ : 

(a)  At  4 x 10^  Rads(Si)  under  cobalt-60  exposure,  the  cir- 
cuit remains  functional  with  no  loss  in  logic  margin. 

At  this  level,  the  maximum  frequency  of  operation  is 
improved  approximately  5 to  10%,  as  are  the  per- 
missible power  supply  variations.  The  device  betas 
have  degraded  to  approximately  3 for  PNP's  and 

13  for  NPN's  at  this  level.  The  ultimate  failure  dose 
has  not  been  determined. 

(b)  Operational  errors  are  made  at  a dose  rate  of  4 x 10® 
Rads(Si)/s.  The  level  at  which  an  operating  circuit 
ceases  to  function  during  the  radiation  burst  is  about 
5 x 109  Rads(Si)/s.  The  outputs  are  saturated  at  the 
ground  state.  At  9 x 1010  Rads(Si)/s  saturation  at  the 
ground  state  persists  for  approximately  1 ps  after  the 
radiation  burst.  Thereafter,  it  recovers  to  normal 
operation. 

4 A O 

(c)  At  a cumulative  fluence  of  101’  N/cm  , an  operating 
circuit  continues  to  operate  without  loss  of  logic  mar- 
gin. The  maximum  frequency  of  operation  is  observed 
to  be  approximately  30%  higher  than  before  exposure. 
At  2 x 1015  N/cm2  the  circuit  ceases  to  function,  but 
operational  recovery  is  achieved  by  thermal  annealing 
at  150°C  for  24  hours. 

(d)  Examination  of  the  circuit  structure  may  indicate  the 
possibility  of  PNPN  latch-up  in  the  NPN  transistor 
regions.  However,  this  has  not  been  reported. 


(3)  Integrated  Injection  Logic 

IIL  has  a very  attractive  speed-power  product,  and  it  is  a 
high  density  bipolar  technology  which  allows  digital  and  analog  circuits  to  co- 
exist on  the  same  chip.  The  structure  and  a simple  logic  gate  are  shown  in 
Figure  21.  The  IIL  gate  has  a single  input  and  multiple  outputs,  in  contrast 
to  the  multiple  input,  single  output  TTL. 

Radiation  effects  in  IIL  tend  to  decrease  the  PNP  alpha  and 
also  tend  to  invert  the  p- regions  at  the  surface  because  of  positive  charge 
storage  in  the  SiO«.  However,  this  latter  effect  is  of  minor  importance  since 
the  impurity  concentration  in  the  p- region  is  rather  high  at  the  surface. 

The  generation  of  interface  state  under  steady  state  ionizing 
radiation  is  a major  source  of  radiation  induced  degradation  in  I^L  circuits. 
The  major  effect  is  a reduction  in  the  gain  of  the  PNP  transistors  as  shown  in 
Figures  22  and  23  for  both  the  PNP  and  NPN  devices  respectively. 

i2l  circuits  are  also  degraded  by  neutron  exposure  in  the 
same  manner  as  other  bipolar  devices.  Experiments  carried  out  by  Raymond 
et  al.®  have  shown  that  failure  occurs  in  the  region  of  3 x 10*3  N/cm2  (on  the 
basis  of  one  device).  However,  severe  degradation  in  h™,  and  "alpha"  occurs 
in  the  region  below  10^2  N/cm^  as  shown  in  Table  14.  Operation  of  the  five- 
stage  ring  oscillator  showed  a reduction  in  frequency  as  a result  of  the  transi- 
ster  gain  degradation. 

The  results  of  the  flash  x-ray  exposure  are  shown  in  Fig- 
ure 24.  First,  it  can  be  seen  that  latch-up  did  not  occur.  The  maximum 
potential  drop  is  one  diode  drop  (approximately  0.  8V)  and  there  is  no  mechan- 
ism known  that  will  support  latch-up  at  such  a low  voltage.  The  data  shows 
that  the  ring  counter  is  affected  by  dose  rates  above  5 x 10®  Rads(Si)/s.  The 
counters  simply  increase  frequency  rather  than  skipping  a beat.  However,  a 
partial  recovery  of  the  initial  operating  frequency  is  seen  in  the  latter  portions 
of  each  record  shown  in  Figure  24. 

(4)  Transistor- Transistor  Logic 

Hardened  versions  of  many  SSI  TTL  functions  are  available 
which  incorporate  gold  doping,  dielectric  isolation,  metal  film  resistors,  low 
Z packaging  techniques  and  photocompensation  diodes  in  the  circuit  design. 
These  circuits  have  been  extensively  tested  and  are  known  to  operate  at  dose 
rates  up  to  109  Rad(Si)/s  . Many  operate  up  to  1010  Rads(Si)/s)  at  full  speci- 
fied fan-outs  of  10.  The  most  notable  exception  is  the  RSN  54L122  Monostable 
Multivibrator,  for  which  the  failure  level  is  about  5 x 10  Rads(Si)/s.  All 
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Figure  24. 


I^L  Ring  Counter  Photoresponse 


TABLE  14.  I2L  TEST  CHIP  NEUTRON- INDUCED  DAMAGE 


Transistor  Common-Emitter  Gain,  hpg. 


Neutron 

Fluence 

$,N/cm2 

Sample  #6 

Sample  #7 

Lateral 

Small 

Large 

Lateral 

Small 

Large 

PNP 

NPN 

NPN 

PNP 

NPN 

NPN 

0 

0.  53 

8.6 

25 

0.49 

7.1 

21.2 

4.7  x 1012 

0.26 

5.2 

13.3 

0.27 

4.6 

12.4 

7.4  x 1012 

0.24 

4.0 

5.6 

0.24 

3.6 

5.6 

Ring  Counter  Frequency,  I£E  = 1 mA 

Sample 

$ = 0 

$ = 4.7  x 10*2  N/cm^ 

U 

1.25  MHz 

0.  74  MHz 

#5 

0.83  MHz 

damaged  in  test 

#6 

1.09  MHz 

damaged  in  test 

#7 

0.  95  MHz 

damaged  in  test 

\ 

\ 
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devices  operated  satisfactorily  at  neutron  fluences  up  to  1 x 1014  N/cm^  with 
acceptably  small  changes  in  output  pulse  levels  and  widths,  with  the  excep- 
tion of  the  RSN  54  L122,  for  which  the  pulse  width  decreased  by  50-90%,  and 
the  RSN  54  L122  and  the  RSN  54  L71,  for  which  the  saturation  voltage  increased  to 
about  1.  0V.  All  circuits  remained  functional  at  total  doses  up  to  and  above 
107  Rads(Si)  in  a steady  state  ionizing  radiation  environment  with  a 35-45% 
increase  in  rise  time  and  no  change  in  fall  time.  Other  testslO  using  shallow 
planar  geometries  show  significant  degradation  of  the  transistor  gain  at  doses 
between  2x10^  and  1 x 10°,  considerably  lower  than  the  levels  given  above. 

8 10  11 12 

Large  scale  integrated  circuit  radiation  effect  studies  ’ ’ ’ 

are  also  underway.  In  these  programs,  the  effort  is  aimed  at  ascertaining 
the  radiation  effects  and  failure  levels  for  existing  unhardened  and  hardened 
microcircuits  now  under  development.  The  failure  levels  obtained  by  Ray- 
mond et  al.Hare  shown  in  Table  15. 

2 . Field  Effect  Technologies 
a.  Damage  Mechanisms 

The  primary  radiation  effects  in  FET's  are  well  known  and 
result  from  both  the  transient  and  steady  state  ionizing  radiation.  The  effect 
of  transient  radiation  pulses  is  the  generation  of  photocurrents  at  each  p-n 
junction  in  the  circuit.  The  current  may  be  reduced  and  the  tolerance  of  the 
circuit  may  be  increased  by  as  much  as  100  times,  by  placing  the  circuit  on 
sapphire,  one  silicon  island  for  each  transistor.  The  result  of  this  procedure 
Is  the  reduction  of  the  p-n  junction  area  by  about  100  times  for  a typical 
circuit. 


The  second  equally  important  effect  occurs  under  steady  state 
ionizing  radiation.  Electron-hole  pairs  are  created  in  the  gate  insulation  and 
since,  in  SiC>2  more  hole  traps  exist  than  electron  traps,  the  oxide  accumulates 
positive  charge,  moving  the  flat-band  and  thus  the  threshold  voltage  in  the 
negative  direction.  An  additional  effect  occurs  in  CMOS/SOS  circuits.  The 
sapphire  also  accumulates  positive  charge  which  tends  to  create  a channel  in 
the  n- channel  transistor  at  the  silicon/sapphire  interface  which  is  manifested 
as  an  uncontrolled  leakage  current.  In  PMOS  and  PMNOS  circuits,  threshold 
shifts  for  identical  transistors  tend  to  be  somewhat  higher  in  magnitude,  so 
that  circuits  that  will  tolerate  3 x 106  Rads  (Si)  on  bulk  silicon  will  tolerate 
only  3 x 10^  Rads(Si)12  when  implemented  on  sapphire.  The  decrease  in  total 
dose  hardness  also  seems  to  be  true  for  CMOS/SOS  4 3,  The  threshold  shift  is 
strongly  dependent  upon  the  bias  applied  during  exposure. 
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TABLE  15.  MSI/LSI  BIPOLAR  VULNERABILITY 


The  procedures  for  hardening  MOS  devices  in  general  are 
listed  in  Table  16.  Sapphire  will  drastically  reduce  junction  area  and  in- 
crease transient  hardness.  Various  processing  techniques  can  be  used  to 
decrease  threshold  voltage  sensitivity  to  total  dose.  Design  tradeoff  be- 
tween speed  and  power  can  also  be  made  to  increase  hardness. 


TABLE  16.  MOS  HARDENING  TECHNIQUES 


• Use  sapphire  to  reduce  p-n  junction  area  and  transient 
photocurrents 

• Use  an  "ultra  pure"  gate  oxide  with  aluminum  ion  implant  in 
the  n channel  insulator  followed  by  a high  temperature  anneal 

• Use  aluminum  oxide  for  the  gate  insulator  to  provide  electron 
as  well  as  hole  traps.  This  technique  seems  to  produce  un- 
stable devices  that  exhibit  threshold  voltage  changes  with 
applied  voltage 

• Use  a chrome  doped  gate  insulator 

• Trade  off  speed  and  power  for  radiation  hardness 

b.  Logic  Family  Performance 

(1)  CMOS  and  CMOS /SOS 

In  one  standard  CMOS  process,  both  the  n and  p channel 
transistors  are  enhancement  mode  devices.  Under  radiation,  the  n-channel 
device  can  become  a depletion  mode  device,  affecting  the  proper  logical 
operation  of  the  circuit. 

Figure  25  is  an  example  of  the  effect  of  total  dose  radia- 
tion on  CMOS  with  an  AI2O3  gate  oxide^.  Similar  transfer  curves  for  the 
other  gate  insulator  have  been  obtained^. 

(2)  PMOS  and  PMOS/SOS 

PMOS  circuits  are  adversely  affected  by  steady  state  ion- 
izing radiation  because  of  the  shift  in  threshold  due  to  oxide  charging  and 
interface  state  creation.  In  order  to  reduce  the  p-n  junction  photocurrents, 
circuits  may  be  fabricated  on  sapphire.  This  brings  in  an  additional  possible 
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failure  mechanism,  source-to-drain  leakage  current.  Reference  data  12  shows 
the  leakage  current  in  a typical  PMOS  device  on  sapphire  is  initially  low  and 
that  it  remains  low  (in  some  cases  actually  decreases)  at  doses  up  to  3 x 10^ 
Rads(Si).  Figure  26  shows  the  threshold  voltage  of  a typical  transistor, 
indicating  a 2V  shift  at  3 x 106  Rads(Si).  These  are  typical  values  for  hard- 
ened devices  using  the  "Ultra  Pure"  Aubuchon  oxide,  a process  that  can  be 
used  even  for  devices  on  sapphire. 

Microcircuits  have  also  been  tested  12  using  dynamic 
PMOS  circuits  on  sapphire,  as  well  as  bulk  PMOS  circuits.  The  observed 
failure  levels  are  shown  in  Table  12. 

(3)  NMOS  and  MOS/SOS 

NMOS  devices  can  be  operated  in  two  modes  - either  in 
their  enhancement  mode,  such  as  CMOS  circuits,  with  positive  gate  threshold 
voltages  or  in  their  depletion  mode  with  attendant  negative  threshold  voltages 
as  used  in  modern  high  speed  NMOS  microcircuits.  No  radiation  test  was  un- 
earthed during  this  study  for  high  speed  NMOS  devices.  The  hypothesis, 
however,  is  that  since  the  thresholds  are  negative  and  since  they  will  change 
in  the  negative  direction  as  in  PMOS  devices,  such  devices  should  behave  in 
much  the  same  way  as  PMOS  devices  and  should  become  as  hardened  using 
the  pure  oxide  approach. 

C.  HYBRID  PACKAGING 

The  design  and  specification  of  multichip  hybrid  packages  requires  many 
tradeoffs.  A detailed  treatment  of  the  many  considerations  are  given  in  the 
references.  Some  general  comments  and  a few  examples  are  given  in  the 
following  paragraphs,  along  with  a description  of  a hybrid  packaged  micro- 
processor system. 

1.  General  Comments 

The  methods  of  bonding  the  chip  to  the  substrate  may  be  grouped  into 
face-up  and  face-down  classes.  Chip  and  wire  methods  are  accepted,  industry 
wide  procedures,  while  beam  leads  and  bumps  represent  recent  bonding  tech- 
niques. Face-up  mounting  allows  higher  chip  power  dissipation  due  to  the 
lower  thermal  resistance  to  the  package,  and  the  chip  area  is  usually  smaller 
with  this  bonding  technique. 

Interconnection  wiring  within  the  package  may  be  accomplished  with 
single  or  double-sided  substrates,  as  well  as  multilayer  arrangements.  Single 
layer  limitations  are  interconnection  area  and  the  availability  of  cross-overs. 
Multilayer  interconnection  requires  additional  material  and  more  complicated 


$ 


processing.  Substrate  design  tradeoffs  between  conductor  and  dielectric 
material,  conductor  width  and  thickness  will  affect  electrical  performance 
such  as  inductive  and  capacitive  coupling,  characteristic  line  impedance, 
crosstalk  and  propagation  time. 

2.  Examples 

The  following  examples  of  multichip  packaging  were  obtained  from 
trade  literature  and  represent  the  kind  of  systems  applicable  to  this  contract. 

a.  CMOS  Memory  for  MJS  Spacecraft 

Algorex  Data  Corporation  is  supplying  a thick  film  hybrid  mem- 
ory circuit  to  the  Jet  Propulsion  Laboratory  for  use  on  the  Mariner  spacecraft, 
which  will  be  launched  in  1977  to  rendezvous  with  Jupiter  and  Saturn.  The 
memory  will  consist  of  four,  3x3  inch,  four-layer  hybrid  circuits,  each  con- 
taining 2048  16-bit  words.  Each  hybrid  contains  140  chips:  128  CMOS  memory 
chips  and  12  driver  chips.  The  memory  chips  are  RCA  CD4061  static  RAMS 
with  a 256  x 1 organization.  The  four  hybrids  do  the  job  of  two  7 x 14  inch 
multilayer  boards,  and  the  cost,  in  the  low  hundred-thousand-dollar  range, 
is  reportedly  half  the  conventional  approach. 

b.  Missile  CPU 

The  same  company  is  also  building  a 16-bit  central  processing 
unit  in  hybrid  form  for  a missile  system.  The  CPU  consists  of  75  digital  IC’s 
and  25  discrete  active  and  passive  chips.  The  ceramic  substrate  is  two  inches 
square  and  contains  10  layers.  The  high  speed  processor  operates  with  a 20 
MHz  clock  time. 

c.  Honeywell  CPU 

Honeywell  is  using  hybrid  circuits  in  its  MOD/LSI  line  of 
aerospace  computers.  One  of  the  computers,  the  MOD/LSI-2,  was  chosen  for 
the  Navy's  advanced  tactical  inertial  guidance  system  (ATIGS)  for  small 
missiles.  In  what  may  be  the  same  hybrid  described  above,  approximately 
75  IC's  and  25  discretes  are  combined  on  a two  inch  square  substrate  and 
housed  in  a 120  lead  package.  This  16  bit  processor  uses  TTL  and  low- 
power  Schottky  TTL  technologies.  The  computer  can  perform  250, 000  - 
400,000  operations  per  second,  depending  upon  the  circuit  implementation, 
memory  technology  and  instruction  mix. 
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d.  Microprocessor  System 


Hybrid  packaging  is  used  by  Teledyne  Semiconductor  to  form 
a microprocessor  system.  A 120  lead  package  contains  40  IC's  and  10  dis- 
crete chips.  Flying  leads  connect  the  chips  to  the  multilayer  substrate.  The 
package  photograph  appears  to  contain  eight  memory  arrays  and  eight  proc- 
essor chips.  The  Teledyne  TDY-52B  hybrid  processor  is  described  in  detail 
in  the  following  paragraphs. 

e.  Hybrid  Microprocessor  System 

Teledyne  Systems  Company  is  making  microprocessor  systems 
using  multichip  hybrid  packaging.  The  package  characteristics  of  the  TDY- 
52 B are  given  below: 

43  IC's,  22  Resistors,  6 Caps,  2 Transistors,  5 Diodes 
120  Lead  package,  2"  x 2"  x 0. 2" 

Multilayer  Substrate 

Chip  and  Wire  Interconnect 

Hermetically  Sealed 

Commercial  Components:  0°  to  70°C 

Typical  Power  Dissipation  - 5.4  W 


The  integrated  circuits  are  commercial  temperature  range 
TTL,  and  low  power  Schottky  TTL  SSI  and  MSI,  and  PMOS  slices  for  the  CPU 
and  CROM.  The  integrated  circuits  have  approximately  575  bonding  pads,  so 
that  1400  bonding  operations  are  required  for  chip- substrate  and  substrate- 
package  interconnection.  Only  25  to  33%  of  the  substrate  is  active  chip  area. 

The  characteristics  of  the  16-bit  processor  are  shown  in  Table 
17.  The  processor  uses  the  National  Semiconductor  IMP-16  four  bit  RALU 
slice  and  a 100  word  by  23-bit  ROM  which  implements  the  standard  instruction 
set.  A second  CROM  is  required  for  the  extended  instruction  set  which  in- 
cludes multiply  and  divide  and  double  precision  add  and  subtract. 

A block  diagram  of  the  TDY-52B  is  shown  in  Figure  27.  The 
Clock  Generator  develops  the  necessary  clock  phases  for  the  MOS  clock  drivers 
and  CPU  timing  signals.  There  are  eight  phases  in  the  basic  1.4  /us  micro- 
cycle. TTL  buffers  are  included  in  the  package  to  buffer  the  MOS  RALU  signals. 
The  RALU  devices  are  controlled  by  a four  bit,  time  multiplexed  bus  from  the 
CROM.  During  normal  program  execution,  the  user's  macro  instruction  is 
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TABLE  17.  TELEDYNE  HYBRID  MICROPROCESSOR  CHARACTERISTICS 


Organization 

Parallel  16  bit,  Microprogrammed 
General  Register/Accumulator 

Registers 

4 General  Purpose 
4 Special  Purpose 
16  Level  Stack 

Instructions 

Basic  Set 

43 

Extended  Arithmetic 

17  Double  Precision,  Multiply, 
Divide 

Input/Output 

16  Bit  Peripheral  Data  In  Bus 
16  Bit  Memory  Data  In  Bus 
16  Bit  Data  Out  Bus 
16  Bit  Address  Out  Bus 
2 Interrupt  Lines 
4 Jump  Condition  Flag  Inputs 

Instruction  Times 

Reg/Reg  Add 

4.  6 iis 

Input/Output 

10  iis 

Power  Requirements 

+5  V dc  at  0. 6A 
-12  V dc  at  0. 2 A 

Price  (50-500  units) 


$1295.00 


JUMP  CONDITION 


Figure  27.  TDY-52B  Block  Diagram 


fetched  from  external  memory.  The  instruction  is  decoded,  directing  the 
control  sequence  to  an  entry  point  in  the  microprogram.  When  the  micro- 
program execution  is  complete,  the  next  macro  instruction  is  fetched. 

Macro  program  execution  can  be  suspended  if  an  interrupt  condition  exists. 
Microprogram  paths  are  affected  by  the  RALU  status  flags,  and  user  sup- 
plied jump  conditions. 

The  chip  types  contained  in  the  TDY-52B  are  listed  in  Table 
18.  One  impressive  observation  is  the  large  number  of  SSI  support  chips 
required.  This  result  is  typical  for  other  16  bit  systems  configured  from 
competing  4 bit  slices.  Smaller  systems  could  be  configured  at  the  expense 
of  some  of  the  powerful  features  that  the  processor  has.  The  processor  con- 
figuration (the  number  of  SSI  parts  and  package  pins)  is  determined  by  the 
application  requirements. 

The  T'eledyne  TDY-52B  is  an  example  of  the  industry  wide 
capability  in  multichip  hybrid  packaging.  The  capability  can  be  extended  to 
any  chip  technology  with  the  appropriate  design  constraints. 

The  state-of-the-art  in  hybrid  technology  and  device  process- 
ing is  exemplified  by  the  Trident  Program.  A blend  of  technologies  which 
have  not  been  combined  before  are  being  used  to  fabricate  hybrid  circuits  for 
use  in  the  command  sequencer,  interlocks  and  flight  control  electronics.  The 
three  technologies,  beam  lead  sealed  junction,  dielectric  isolation  anu  low 
power  Schottky  TTL,  produce  low  yield  military  devices  when  used  singularly, 
and  result  in  lower  yields  when  combined.  Although  some  circuit  and  proc- 
ess improvements  have  been  made,  some  substitution  of  junction  isolated  cir- 
cuits will  probably  be  made  for  the  production  program. 
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TABLE  18.  TDY-52B  CHIP  TYPES 


Type 

Quantity 

Function 

Integration 

Pins 

74LS00 

2 

Quad  NAND 

SSI 

14 

74LS02 

1 

Quad  NOR 

SSI 

14 

74LS04 

1 

Hex  Inverter 

SSI 

14 

74LS08 

8 

Quad  Buffer 

SSI 

14 

74LS11 

1 

Triple  NAND 

SSI 

14 

74LS32 

1 

Quad  OR 

SSI 

14 

74LS54 

1 

AND  - OR- IN  V 

SSI 

13 

74LS74 

5 

Dual  D F/F 

SSI 

14 

MH0026 

2 

MOS  Drvrs. 

SSI 

6 

8095 

1 

Hex  Buffer 

SSI 

16 

74195 

1 

4 Bit  Reg. 

MSI 

16 

8121 

2 

8 Chan  Mux 

MSI 

16 

85L51 

4 

Quad  D F/F 

MSI 

16 

9322 

1 

Quad  Mux. 

MSI 

16 

9334 

2 

8 Bit  Reg. 

MSI 

16 

81L23 

4 

Quad  Mux. 

MSI 

16 

MM5750 

4 

RALU 

LSI 

24 

MM5751 

2 

CROM 

LSI 

24 

43 

Integration: 

Interconnections: 

14%  LSI 

575  IC  Pads  x 2 = 

1150 

33%  MSI 

120  Pkg  Pins  x 2 = 

240 

53%  SSI 

Total 

1390 

r 
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SECTION  IV 

ARCHITECTURE  STUDIES 
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A.  MICROPROCESSOR  STATE-OF-THE-ART 

In  1971  Intel  introduced  one  of  the  first  single- chip  microcomputers  - 
the  4004.  The  initial  devices  were  an  outgrowth  of  the  commercial  calculator 
developments  and  were  generally  four- bit  devices  with  decimal  arithmetic 
capability.  The  second  generation  devices  featured  a longer  eight- bit  word 
and  the  technology  changed  from  the  slower  p- channel  to  the  faster  n- channel 
MOS  process.  Present  generation  devices  show  evolution  along  two  archi- 
tectural lines.  More  computing  power  can  be  achieved  either  by  building 
denser,  single  chip  Central  Processing  Units  (CPU)  or  by  structuring  micro- 
computers out  of  multiple  byte  slice  elements.  Single  chip,  16-bit  devices, 
such  as  the  National  PACE,  General  Instrument  CP- 1600,  and  General 
Automation  LSI- 16,  have  been  fabricated,  showing  one  trend  in  the  direction 
of  single-chip,  stand-alone  microcomputer  systems.  The  other  approach 
offers  two-bit  and  four- bit  byte  slice  elements  from  National,  Intel,  Mono- 
lithic Memories  and  Texas  Instruments.  The  slice  devices  are  predomi- 
nantly bipolar  technologies  (Schott  ky  and  Integrated  Injection  Logic)  for 
speed  reasons,  while  the  MOS  processes  are  retained  for  high  density  in  the 
16- bit  processors. 

Of  the  two  choices,  the  byte  slice  architecture  seems  to  be  more  com- 
patible with  military  mission  objectives.  It  provides  a wide  temperature 
range  of  operation  and  meets  the  desired  computing  requirements  of  a broad 
spectrum  of  applications.  It  also  provides  a means  of  building  a range  of 
microcomputer  architectural  structures.  The  byte  slice  architecture  offers 
the  designer  a standard  building  block  which  allows  him  to  tailor  the  machine 
organization  to  his  own  needs. 

The  characteristics  of  mid-1975  microprocessors  are  given  in  Table 
19.  The  characteristics  are  loosely  grouped  into  three  areas:  credibility- 
risk,  hardware  design  and  software  design.  In  addition  to  the  tabular  data, 
the  following  paragraphs  discuss  several  of  the  microprocessors  in  detail. 


TABLE  19.  Mi: RC PROCESSOR  CHARACTERISTICS 


1.  Intel  8080 

a.  Hardware  Considerations 

The  Intel  8080  is  an  eight-bit  parallel  CPU  packaged  in  a 40- 

• n.  ,qi  in  t ine  Package  (DIP).  It  requires  multiple  power  supplies  (±  V, 
,in  Dual  In-Line  Package  uj  ).  h rmal  operation  the  chip  dis- 

I-12V)  and  a two-phase,  high-level  clock,  m £ AMD  and  TI.  A 

sipates  1. 3 W.  The  NMOS  device  is  also  being  suppled  oy 

alock  diagram  of  the  chip  is  shown  in  Figure  28. 


Figure  28.  Intel  8080  Block  Diagram 
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stack- referencing  call  instructions  are  slightly  longer. 
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The  8080  operates  at  a clock  frequency  of  2 MHz.  The  time 
required  for  a register-to-register  add  is  2 /is.  The  register  load  time  for 
an  instruction  is  2.5  /is  and  the  register-to-memory  add  time  is  3.5  /is. 

b.  Hardware  Support 

Intel  provides  an  extensive  spectrum  of  possibilities  from  chips  to 
chip  sets  to  boards  up  to  minicomputer  (Intellec  8 MOD  80)  consoles. 

c.  Software 

The  data  and  instruction  word  size  is  eight  bits  in  length  with  16- 
bits  for  addressing.  By  design  there  is  major  software  compatibility  between 
the  8080  and  8008.  The  8080  has  a total  of  78  instructions.  The  8080  permits 
any  of  the  three  main  registers  to  hold  and  output  an  address  when  using 
register-indirect  addressing.  Multiple  indexing  can  be  accomplished  with 
these,  but  it  takes  additional  steps  compared  to  the  classical  index  register 
concept.  The  push  down  stack  can  be  used  to  hold  status  bits  and  important 
register  contents  for  interrupt  servicing.  The  data  manipulation  instructions 
include  arithmetic,  logic,  BCD  arithmetic,  and  double  precision  operations. 
All  general  purpose  registers  can  be  incremented  or  decremented.  The  use 
of  the  stack  pointer  (SP)  allows  unlimited  subroutine  nesting. 

d.  Software  Support 

Intel  has  made  a heavy  investment  in  developing  a high  level  lan- 
guage for  this  machine  (PL/M).  Macro  and  resident  assemblers  are  avail- 
able from  Intel  and  General  Electric.  Intel  also  provides  an  instruction 
simulator  and  cross  assembler. 

e.  System  Considerations 

Bus  controls  allow  for  Direct  Memory  Access  (DMA).  The  8080 
has  multiple  interrupt  capability  which  is  software  enabled  or  disabled. 
However,  it  cannot  be  microprogrammed.  The  Input/Output  (I/O)  is  accom- 
plished on  the  same  tristate  data  bus  and  is  limited  to  256  ports. 

f.  Comments 

The  8080  has  some  potential  disadvantages: 

(1)  The  lack  of  indexed  addressing  is  serious  in  some  applications. 

(2)  The  8080  needs  three  power  supplies. 

(3)  There  is  no  2's  complement  arithmetic  for  signed  numbers. 
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However,  the  8080  is  a machine  which  has  set  the  standard  for 
second- generation  performance  in  eight-bit  microprocessors.  The  PL/M 
language  developed  by  Intel  is  the  industry  trend  to  support  software 
standardization. 

2.  Motorola  6800 

a.  Hardware  Considerations 

The  Motorola  6800  is  an  integrated  chip  set  with  some  special 
parts  for  input/output.  The  eight-bit  CPU  is  packaged  in  a 40-pin  DIP.  It 
requires  a single  +5  V supply  with  a two- phase,  5 V clock.  Normal  operation 
requires  a maximum  of  1.2  W of  power.  Figure  29  is  a block  diagram. 


Figure  29.  Motorola  6800  Block  Diagram 


The  CPU  itself  is  organized  as  a conventional  computer  with  an 
eight-bit  data/instruction  path.  Instructions  execute  in  2 to  12  /is.  There 
are  two  accumulators  and  a 16-bit  index  register.  Return  addresses  are 
stored  in  RAM  and  that  stack  is  referenced  via  a pointer  in  the  CPU.  There 
are  16  bits  of  address  available;  however,  abbreviated  addresses  of  eight 
bits  can  be  directed  to  refer  to  the  first  256  bytes  of  main  memory. 
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The  6800  operates  with  a 1MHz  two-phase  clock.  A register  to 
register  add  is  accomplished  in  2 /is.  A register  to  memory  add  time  is  4 
to  6 iis  depending  on  address  modification. 

b.  Hardware  Support 

Motorola  provides  special  LSI  integrated  circuits  to  interface 
with  the  CPU  chip.  Specifically  they  provide  a 1024  x 8 bit  Read  Only  Memory 
(ROM),  a 128  x 8 bit  RAM,  a bi-directional  Peripheral  Interface  Adapter 
(PIA),  and  an  Asynchronous  Communications  Interface  Adapter.  Also  various 
levels  of  printed  circuit  board  and  console  aids  are  being  developed. 

c.  Software 

The  data  and  instruction  word  size  is  eight  bits  in  length  with 
a 16- bit  address.  There  are  72  instructions  available.  Programming  the 
6800  is  relatively  easy,  but  there  are  disadvantages.  There  are,  for  example, 
no  direct  paths  from  the  two  accumulators  to  the  index  register.  To  compute 
an  index  requires  storing  the  16-bit  value  into  RAM  and  then  reading  it  into 
the  index  register.  The  choices  of  various  addressing  modes  are  somewhat 
arbitrary  because  only  certain  instructions  admit  certain  modes;  it  is  the 
programmer's  responsibility  to  remember  which  addressing  forms  are  valid 
for  each  instruction.  The  use  of  the  stack  pointer  allows  almost  unlimited 
subroutine  nesting.  The  instructions  are  arithmetic  and  logical  which  take 
advantage  of  the  two  accumulators.  The  6800  has  instructions  for  storing  the 
status  register.  Relative  addressing  allows  data  relocation.  The  programmer 
can  reach  the  first  256  locations  of  memory  with  "short"  instructions  which 
will  allow  most  programs  to  refer  to  these  locations  for  common  work  space. 
The  6800  instruction  set  was  copied  as  closely  as  possible  from  the  PDP-11 
with  a shorter  word  length. 

d.  Software  Support 

Motorola  is  developing  a cross-assembler,  interactive  simu- 
lator and  a PL/M  type  high-level  language. 

e.  System  Considerations 

The  6800  provides  for  maskable  interrupts  at  four  levels.  All 
the  I/O  devices  appear  at  one  interrupt  level  so  that  once  an  interrupt  is  de- 
tected, polling  must  be  used  to  isolate  the  device  requiring  attention.  When 
an  interrupt  occurs,  the  processor  places  status  and  register  contents  into  the 
push  down  stack  automatically.  The  interrupt  return  instruction  refreshes 
the  state  of  the  machine  from  the  stack.  A software  equivalent  of  the  inter- 
rupt is  provided  too;  the  computer’s  state  is  pushed  onto  the  stack,  and  an 
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interrupt  service  routine  is  invoked.  DMA  is  provided  through  a Three-State 
Control  Signal.  The  6800  is  a microprogrammed  machine  but  is  not  micro- 
programmable  by  the  user. 

f.  Comments 

The  6800  is  a compromise  of  the  powerful  PDP-11  and  an 
eight-bit  machine.  It  has  a good  instruction  set  with  excellent  conditional 
branching.  The  major  disadvantages  are  its  slow  speed  and  lack  of  software 
support. 

3.  General  Electric  CRD- 8 

a.  Hardware  Considerations 

The  CRD-8  is  an  eight- bit  parallel  processor  unit  currently 
being  produced  by  General  Electric  on  a two-sided  printed  circuit  board  ap- 
proximately 4x8  inches.  This  card  contains  approximately  60  standard  T2L 
devices  including  IK  of  Program  RAM.  Typical  power  dissipation  is  18  to  20  W. 
Figure  30  is  a block  diagram. 
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Figure  30.  GE  CRD-8  Block  Diagram 
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The  CRD-8  CPU  operates  as  a signed  2’s  complement  machine. 
Data  access  to  the  machine  is  page  oriented.  For  many  instruction  types  an 
eight-bit  page  register  located  in  the  RAM  register  bank  provides  the  upper 
eight  bits  of  the  16- bit  address  specification.  In  addition  to  the  eight- bit  page 
register,  a 16- bit  program  counter  (PC)  and  program  counter  save  (PCS)  are 
located  in  the  RAM  register  bank.  The  program  counter  save  contains  the 
return  address  from  subroutine  calls.  The  PC,  PCS  and  PAGE  registers 
have  duplicate  counterparts  for  interrupt  programs.  The  interrupt  structure 
is  such  that  when  an  interrupt  is  processed,  thelPC,  IPCS  and  IPAGE  regis- 
ters are  accessed  instead  of  the  PC,  PCS  and  PAGE.  Return  from  interrupt 
reverses  this  register  swap.  The  remainder  of  the  RAM  REG  Bank  contains 
three  16- bit  registers. 

The  CRD-8  operates  at  a clock  frequency  of  4 MHz.  Typical 
instruction  time  is  approximately  2 /is,  based  on  a memory  access  time  of  100  ns. 

b.  Hardware  Support 

GE  offers  a standard  I/O  board  which  provides  proper  I/O 
multiplexing  on  the  data  bus.  Decoding  logic  for  Teletype  I/O  is  included.  A 
standard  maintenance  panel  is  also  available  providing  an  operational  proces- 
sor unit  that  contains  the  CPU  board,  I/O  board,  and  memory  cards.  A 
power  supply  package  for  a 1K-8K  RAM  processor  is  also  available. 

c.  Software 

The  data  and  instruction  words  are  eight  bits  in  length,  while 
the  address  bus  is  16  bits  wide.  The  lower  half  of  the  three  RAM  BANK 
registers  can  be  used  in  arithmetic  operations  while  the  full  16  bits  of  each 
register  can  be  used  as  memory  addresses.  This  provides  indirect  address- 
ing with  the  added  options  of  register  post  increment  (after  indirect  usage) 
or  register  predecrement  (decrement  register  prior  to  indirect  usage).  All 
incrementing  and  decrementing  is  handled  in  the  eight-bit  ALU  which  causes 
"roll- over"  at  all  page  boundaries  in  those  applications  where  16- bit  ad- 
dresses are  incremented  beyond  a page  boundary.  The  instruction  repertoire 
contains  extensive  branching  capabilities  on  the  various  condition  flags 
(overflow,  carry,  zero).  Expanded  I/O  capability  is  available  as  a hardware 
option  (Phantom  I/O)  providing  a full  page  (256)  of  I/'O  ports  which  are  ac- 
cessed via  memory  read/write  instructions. 

d.  Software  Support 

GE  has  developed  a full  set  of  software  support  for  the  CRD-8 
including  an  Assembler,  Compiler  (8-BOL,  similar  in  structure  to  PL/M), 
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Macro  Simulator,  Microcode  Simulator,  and  CRD- 8 Software  for  program 
loading  and  debug. 

e.  System  Consideration 

Extended  temperature  range  operation  as  well  as  other  im- 
proved environmental  specifications  are  possible  by  utilizing  5400  series 
T^L  in  place  of  the  commercial  grade  components.  New  instructions  can  be 
implemented  easily  by  reprogramming  the  control  ROMS.  This  is  not  pos- 
sible with  other  chip  type  processors  which  have  the  control  ROMS  on  chip. 

f.  Comments 

The  CRD- 8 processor  is  unique  in  that  it  is  a processor  design 
implemented  in  standard  T*L  rather  than  a special  LSI  chip,  with  associated 
peripheral  logic.  There  are  distinct  advantages  and  disadvantages  to  such  an 
approach: 


Advantages: 

(1)  Modifications  to  the  CPU  architecture  can  easily  be 
implemented. 

(2)  A Mil-Spec  processor  capability  with  a minimum  of 
redesign. 

Disadvantages: 

(1)  Higher  cost  and  power  consumption  than  an  LSI 
equivalent. 

(2)  Larger  in  size  than  CPU  chip-type  processors. 

General  Electric  has  an  agreement  with  Solid  State  Scientific 
in  which  S^  will  implement  the  GE  design  using  CMOS/SOS.  Samples  should 
be  available  in  late  1975. 

4.  Intersil  IM  6100 

a.  Hardware  Considerations 

The  IM  6100  is  the  first  microprocessor  incorporating  a 12-bit 
CPU  on  one  chip.  The  CMOS  chip  requires  a +3  to  +7V  supply  voltage.  All 
signals  are  TTL  compatible.  Intersil  boasts  a low  power  of  10  mW  for  typical 
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operations.  All  that  is  needed  for  a clock  is  a crystal  (rest  of  needed  clock 
circuitry  is  internal  to  the  chip).  Figure  31  is  a block  diagram. 


Figure  31.  Intersil  IM  6100  Block  Diagram 


The  IM  6100  architecture  emulates  the  Digital  Equipment 
Corporation  PDP-8.  The  CPU  has  six  12-bit  registers:  Accumulator, 
Multiplier- Quotient,  Program  Counter,  Memory  Address,  Instruction  and  a 
Temporary  Register.  Operations  are  decoded  in  a PLA.  The  CPU  has 
direct,  indirect  and  autoindexed  memory  addressing.  The  IM  6100  recognizes 
all  of  the  PDP-8  software.  It  can  handle  64  I/O  devices  and  4K  of  memory 
directly  (32K  with  extended  memory  control  hardware).  It  has  DMA  capability 
and  can  handle  any  type  or  speed  semiconductor  memory  in  any  combination. 

In  short,  the  IM  6100  is  a PDP-8  miniaturized  in  a 40-pin  package.  A 12-bit 
memory  to  accumulator  ADD  is  accomplished  in  5 fis. 

b.  Hardware  Support 

No  hardware  support  is  supplied  by  Intersil  with  the  exception 
of  semiconductor  memories.  However,  any  support  DEC  produces  for  the 
PDP-8  is  compatible  with  the  IM  6100. 
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c . Software 

The  IM  6100  has  over  40  instructions.  It  recognizes  the  in- 
struction set  of  the  PDP-8  including  AND,  TAD,  ISZ,  DC  A,  JMS,  JMP, 

JOT  and  OPR.  One  notable  omission  is  a subtract  operation.  This  must  be 
done  using  2 ’s  complement  and  adding.  Conditional  branching  can  only  be 
done  on  zero  accumulator,  non-zero  accumulator,  negative  accumulator  and 
positive  accumulator  values. 

d.  Software  Support 

Since  the  IM  6100  is  software  compatible  with  the  PDP-8,  any 
support  software  the  PDP-8  has  can  be  used  for  the  IM  6100.  The  PDP-8  is 
a mature  machine  and  has  a large  amount  of  support  software  available. 

There  is  some  type  of  legal  agreement  for  software  between  Intersil  and 
Digital  Equipment  Corporation. 

e.  System  Considerations 

The  IM  6100  and  the  PDP-8  are  software  compatible.  The  dif- 
ferences in  the  hardware  interfaces  are  described  below: 

(1)  IM  6100  handles  up  to  4K  12-bit  words  of  memory. 

(2)  The  Extended  Arithmetic  Element  option  of  the  PDP-8  can- 
not be  used  with  the  IM  6100. 

(3)  DMA  structures  are  different. 

(4)  The  IM  6100  control  panel  is  treated  as  a programmed  I/O. 

(5)  The  IM  6100  may  have  a completely  "transparent”  boot- 
strap loader. 

f.  Comments 

The  IM  6100  is  a PDP-8  with  the  above  exceptions.  One  prob- 
lem with  the  IM  6100  is  that  in  the  PDP-8,  a subroutine  call  modifies  the  first 
word  of  the  subroutine ; the  return  address  is  stored  there.  This  will  not 
work  if  subroutines  are  stored  in  ROM,  as  in  many  microprocessor  applica- 
tions. The  lack  of  a subtract  instruction  is  a minor  difficulty  which  can  be 
solved  with  more  memory  and  slower  execution. 
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5.  Intel  3000 

a.  Hardware  Considerations 

The  3000  family  is  a microprogrammable  processor  imple- 
mented in  two-bit  slices.  The  3002  is  a two-bit  slice  while  the  3001  is  a 
microprogram  control  unit.  The  two  devices  require  a single  +5  V supply 
with  0. 750  W for  the  3002  and  0.  900  W for  the  3001.  Typical  room  tempera- 
ture speeds  of  160  ns  per  microinstruction  can  be  achieved.  Figure  32  is  a 
block  diagram. 
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Figure  32 . Intel  3000  Block  Diagram 


The  architecture  and  microinstruction  set  have  provisions  for 
adding  additional  hardware  to  expand  its  capabilities.  The  microprogram  is 
stored  in  conventional  bipolar  ROM's  and  pipeline  registers  can  be  added  to 
increase  speed.  The  40  pin  3001  generates  the  ROM  address  sequences  by 
analyzing  the  current  microinstruction,  status  flags  and  up  to  eight  bits  of 
the  macro  instruction  fetched  from  main  storage.  Extensive  control  logic  in- 
cludes four-way  and  sixteen-way  jumps  that  are  steered  by  the  microinstruc- 
tion bits.  The  MCU  stores  the  ALU  flag  bits  and  selectively  re-inserts  them 
on  succeeding  cycles  to  implement  some  functions. 
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The  MCU  has  no  provisions  for  saving  or  restoring  the  cur- 
rent microinstruction  address.  This  weakness  eliminates  microlevel  sub- 
routines which  are  a serious  inconvenience.  Attempts  to  save  the  address 
externally  will  be  hampered  by  the  fact  that  only  eight  of  the  nine  address 
bits  can  be  loaded  back  into  the  chip.  There  are  ways  to  implement  micro- 
subroutines with  external  hardware  but  they  all  appear  to  be  difficult. 

The  28-pin,  two-bit  slice  allows  both  ripple  and  look-ahead 
carry;  however,  a special  look-ahead  carry  generator  is  required.  The  3002 
has  only  a carry  out  flag.  Sign  and  zero  flags  can  be  generated  with  external 
hardware  in  the  same  microcycle.  The  overflow  flag  cannot  be  generated 
within  the  same  microcycle.  This  is  a serious  problem  in  implementing  a 
minicomputer.  The  3002  has  10  general  registers  (no  read/write  capability 
in  the  same  cycle)  but  does  not  have  a Q register.  The  3002  has  a K bus 
which  supplies  bit  masking  capability. 

b.  Hardware  Support 

Intel  provides  a 16- bit  minicomputer  kit  for  $750  which  pro- 
vides most  of  the  essential  components  for  the  computer.  They  also  have  a 
line  of  computer  oriented  components  (priority  interrupt  control,  registers, 
etc. ) which  will  supplement  their  computer  products.  Intel  has  no  plans  to 
introduce  a general  purpose  macro  instruction  set  (like  8080)  and  thus  has 
no  plans  for  a front  panel,  rack,  boards,  etc. 

c.  Software 

The  instruction  set  is  developed  by  the  user  for  a given  archi- 
tecture. The  slice  responds  to  over  40  useful  functions  - arithmetic,  logic, 
shifting,  incrementing,  decrementing,  bit/byte  manipulators  are  achieved  on 
the  3002. 

d.  Software  Support 

Intel  provides  a symbolic  microassembler  for  development  use. 

e.  System  Considerations 

Interrupts,  I/O,  Addressing,  DMA,  etc.,  would  have  to  be  de- 
veloped by  the  user. 

f.  Comments 

The  inconvenience  of  the  slice  is  that  it  is  only  two  bits  (more 
chips  required  per  word  length  than  a four-bit  slice)  and  it  does  not  generate 
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all  flags  needed  for  minicomputer  operation.  Not  having  a Q register  or 
read/write  capability  into  the  general  registers  in  the  same  microcycle  re- 
sults in  a slow  multiply/divide  operation.  The  3001  MCU  also  has  problems 
in  generating  addresses  (diagonal  transportation  in  both  memory  page  and 
location  is  impossible  in  one  microcycle).  Another  disadvantage  is  the 
microlevel  subroutine  weakness.  These  disadvantages  add  up  and  produce  a 
relatively  slow  (compared  to  microinstruction  speed  capability)  minicomputer. 
However,  if  the  goal  is  not  a minicomputer  but  a data  controller,  the  bit /byte 
manipulations  produce  a very  desirable  machine. 

6.  Monolithic  Memories  5701 
a.  Hardware  Considerations 

The  MMI  5701  is  a Register/ALU  four-bit  slice.  It  contains  no 
circuitry  required  for  instruction  fetching  and  decoding.  The  slice  contains 
an  arithmetic  logic  unit  (like  the  74181),  a 16  register  file,  shifting  capabili- 
ties and  a Q register  used  to  extend  the  accumulator  to  double  precision. 

There  are  two  four-bit  address  lines  to  select  the  register  file.  The  register 
file  has  separate  read/write  capabilities  such  that  both  functions  can  be 
achieved  in  one  microcycle.  The  maximum  cycle  time  over  the  full  tempera- 
ture range  is  250  ns.  One  +5  V supply  is  needed  with  a single-phase  clock. 
Maximum  power  required  is  1.25  W.  Figure  33  is  a block  diagram. 


Figure  33.  MMI  5701  Block  Diagram 
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Control  of  data  flow  requires  16  bits:  two  four-bit  addresses 
for  the  register  file  and  eight  instruction  bits  which  are  decoded  into  17  con- 
trol signals  on  the  chip.  There  are  also  four- bit  input  and  output  ports  for 
communicating  with  a bus  system.  The  slices  are  designed  to  be  cascaded 
to  derive  machines  of  any  word  length.  Carry  propagate  and  generate  can  be 
used  with  look-ahead  carry  generators  to  improve  the  speed  performance 
of  larger  systems. 

The  four-bit  slice  produces  four  flags  each  cycle.  They  are 
overflow,  sign,  zero  detect  and  carry.  These  flags  are  essential  for  use  in 
a minicomputer  system.  The  Q register  allows  easy  implementation  of 
multiply/divide  algorithms.  Thus,  due  to  the  Q register,  flags  and  dual 
read /write  register  modes,  the  MMI  5701  allows  a minicomputer  archi- 
tecture to  be  implemented  in  fewer  cycle  steps  than  other  available  slices. 

b.  Hardware  Support 

Monolithic  Memories  offers  an  MMI  300  microcomputer  that 
emulates  Data  General's  Nova  1200  series  line.  A wide  range  of  boards, 
chassis  and  debug  packages  are  available  from  MMI. 

c.  Software 

The  instruction  set  is  developed  by  the  user  for  a given  archi- 
tecture. The  slice  responds  to  32  microinstructions.  Arithmetic,  logic  and 
shifting  is  easily  achieved;  however,  bit /byte  testing  and  manipulation  would 
be  achieved  externally  to  the  chip. 

d.  Software  Support 

Debugging  software  is  available  for  the  MMI  300;  however, 
actual  programming  support  is  nonexistent. 

e.  System  Considerations 

Interrupts,  I/O,  Addressing,  DMA,  etc. , would  have  to  be  de- 
veloped in  the  architecture  by  the  user.  This  gives  the  user  the  flexibility  to 
design  a minicomputer  which  is  tailor-made  for  a specific  application. 

f.  Comments 

The  5701  is  one  of  the  few  military  microprocessor  devices 
available.  What  it  lacks  in  speed,  it  compensates  for  in  architecture.  It 
readily  yields  a minicomputer  architecture.  Its  only  major  disadvantage  is 
lack  of  a control  circuit  design  and  the  difficulty  in  manipulating  data  bits  and 
bytes. 
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7.  Texas  Instruments  SBP0400 


a.  Hardware  Considerations 

The  SBP0400  is  a four- bit  RALU  slice.  It  is  the  only  micro- 
processor product  that  uses  I^L  technology.  i2l  yields  the  best  density  of 
the  known  technologies.  T.I.  has  boasted  execution  times  of  350  to  530  ns, , 
however,  these  speeds  have  not  been  achieved  as  yet.  The  power  supply  is 
not  a voltage  but  a current  supply.  This  is  achieved  by  a voltage  supply  and  a 
current  limiting  resistor  in  series  with  the  slice.  The  slice  requires  a maxi- 
mum of  250  mW  of  power.  All  inputs  and  outputs  to  the  40-pin  package  are 
TTL  compatible.  A single-phase , 2 MHz  clock  is  required.  Figure  34  is 
a block  diagram  of  the  SBP0400. 


Figure  34.  SBP0400  Block  Diagram 


The  slice  has  seven  working  registers  and  a program  counter. 
The  accumulator  has  a Q register  for  double  precision  arithmetic.  Shifting 
can  be  accomplished  before  the  A or  Q register  via  a multiplexer.  The  nine- 
bit  operation  word  is  decoded  in  the  slice  by  a PLA.  The  120  bit- output  of 
this  PLA  may  be  latched  with  an  external  clock.  This  allows  the  external 
circuitry  to  address  a new  instruction  while  the  slice  performs  the  present 
operation,  This  technique  is  called  PIPELINING  and  is  used  to  achieve  a high 
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speed  capability.  The  program  counter  may  be  incremented  to  the  next 
instruction  after  the  PL  A is  latched.  The  chip  has  a problem  in  the  flags 
that  are  present.  No  overflow  flag  is  provided  and  the  sign  flag  must  be 
generated  externally. 

b.  Hardware  Suppor* 

None. 

c.  Software 

The  instruction  set  is  developed  by  the  user  for  a given  archi- 
tecture. The  slice  responds  to  459  nonredundant  operations.  Arithmetic, 
logic  and  shifting  are  easily  achieved.  Multiplication  can  be  accomplished  by 
using  the  accumulator  extension  register. 

d.  Software  Support 

None. 

e.  System  Considerations 

Interrupts,  I/O,  Addressing,  DMA,  etc. , would  have  to  be  de- 
veloped by  the  user.  This  gives  the  user  the  flexibility  to  design  a minicom- 
puter tailor-made  for  a specific  application. 

f.  Comments 

Even  though  I^L  is  not  as  fast  as  t2l,  speed  can  be  improved 
by  using  the  pipeline  register  scheme  internal  to  the  chip.  Chips  can  be 
cascaded  using  carry  look-ahead  generators.  The  Q register  and  t2l  com- 
patibility with  i2l’s  low  power  make  the  chip  very  useful.  The  lack  of  flags 
and  slower  than  t2l  speeds  are  the  only  disadvantages. 

B.  MICROCOMPUTER  FORECASTS 

Extrapolated  from  current  technology,  predictions  for  1980  would 
indicate  considerable  improvements  in  LSI  capabilities  as  shown  in  Table  20. 

Assuming  the  projections  are  correct,  for  a few  hundred  dollars,  a 
minicomputer  of  the  DEC  PDP-11  class  could  be  configured  with  the  CPU 
on  a single  chip.  Similarly,  packages  containing  either  4K  x 16  bit  RAM  or 
ROM,  an  I/O  controller  or  Peripheral  Interface  Adapter  (PIA)  can  be 
envisioned. 
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TABLE  20.  LSI  CAPABILITIES 


LSI  Parameter 

1966 

LSI 

1973/1974 

LSI 

1980  (projected 
VSLI) 

Performance 

Clock  rate  (max)  (MHz) 
Speed- power  product  (pj) 

5 

100 

30 

3 to  10 

100 
0. 1 to  1 

Complexity 

Chip  size  (max)  (mil) 

Device  density  (devices/chip) 

100 

50 

250 

5000 

500 

200,  000 

Functions 

Random-access  memory 
Random  logic 

Minicomputer  CPU  (bipolar) 

16  bits 
4 gates 

2048  bits 
500  gates 
10  chips  at 
5 MHz 

64,  000  bits 
10,  000  gates 
1 chip  at  50 
MHz 

To  illustrate  the  effect  of  these  projections,  consider  the  present  day 
LSI- 11  microcomputer,  which  can  emulate  the  PDP- 11/40  minicomputer. 
The  CPU  is  contained  in  four  NMOS  LSI  devices,  housed  in  40-pin  packages. 
The  8.  5 x 10  inch  printed  circuit  board  also  contains  a 4K  x 16  RAM,  a 16- 
bit  I/O  port,  as  well  as  interrupt,  clock,  and  power-on  reset  logic.  With 
1980  technology,  the  present  60  DIP,  85  square  inch  design  could  be  reduced 
to  4 DIP'S  and  15  square  inches. 

Looking  deeper  into  the  crystal  ball  to  the  mid-1980's,  one  might 
expect  to  have  3/4  inch  square  LSI  chip  with  up  to  1/2  x 10®  active  devices. 
This  would  make  it  possible  to  fabricate  the  CPU,  a 4K  x 16  electrically 
programmable  non-volatile  memory,  a universal  interface  adapter,  DMA 
control,  bus  control,  a clock  and  timing  circuits  onto  a single  chip.  This 
chip  could  be  a truly  universal  microcomputer  similar  to  present-day  mini- 
computers, having  high  speed,  ultra- reliability,  low  power  dissipation,  low 
cost  and  small  size,  all  contained  in  a 1 inch  x 2 inch,  40-pin  DIP  package. 
This  universal  chip  would  provide  preprocessing,  central  processing  and 
post- processing  computer  capabilities.  A combination  of  these  could  be 
structured  as  a distributed  computing  architecture  that  would  satisfy  most 
medium  performance  aircraft  and  spacecraft  computing  requirements. 


C0  MICROCOMPUTER  ARCHITECTURE  CONSIDERATIONS 

1 . Objective 

The  design  and  implementation  of  a microcomputer  system  which 
must  perform  satisfactorily  in  a hostile  radiation  environment  requires  a set 
of  hardened  functional  elements.  Besides  this  hardening  constraint,  it  is 
desirable  to  design  a computer  architecture  which  allows  varying  computa- 
tional performance.  Flexible  performance  provides  a means  of  standard- 
izing hardware  elements  on  cost  effective  LSI  designs  while  allowing  a 
reasonable  match  to  a spectrum  of  military  applications. 

The  bandwidth  of  computing  applications  for  space  and  aircraft 
missions  in  many  ways  presents  a dilemma.  Present  LSI  microcomputer 
technology  provides  limited  computing/performance  capabilities  which  de- 
mands a compromise  of  some  of  our  objectives. 

2.  Limitations 

A microcomputer  is  defined  as  a stored  program  computer  system 
in  which  the  central  processing  unit  (CPU)  consists  of  a small  number  of  LSI 
circuits.  The  only  requirement  of  this  definition  is  the  exclusive  use  of  LSI 
in  the  CPU.  This  definition  is  not  based  upon  performance  or  word  size.  A 
microcomputer  should  not  be  confused  with  a microprocessor.  The  word 
microprocessor  has  at  least  two  commonly  accepted  meanings.  It  can  either 
denote  a microcomputer  or  a computer  that  is  microprogrammed.  When 
used  in  the  latter  context,  some,  but  not  all  microcomputers  are  micro- 
processors. 

Some  microcomputer  systems  that  are  readily  avPilable  today  are 
based  on  some  form  of  MOS  LSI  technology  and  are  designed  for  nonmilitary 
uses.  The  associated  logic  speed  is  modest  compared  to  the  bipolar 
Transistor  TransistorLogic  (TTL)  devices  commonly  used  in  minicomputers 
and  large  scale  computers.  In  practice,  microcomputer  designers  are  using 
logic  speeds  that  are  low  compared  to  those  used  in  other  products.  Em- 
phasis has  been  placed  on  rather  conservative  technology.  Functionally  and 
computationally,  the  application  user  derives  modest  performance  capabilities. 

The  economics  of  circuit  production  place  a limit  on  the  maximum 
silicon  chip  area.  Microcomputer  designs  suffer  from  this  constraint  in  at 
least  two  ways.  First,  much  of  the  logic  of  a microcomputer  is  not  the  highly 
regular  form  that  yields  high  chip  density.  Therefore,  the  total  number  of 
logic  functions  available  is  limited.  Second,  the  chip  area  required  to  buffer 
signals  off-chip  is  several  times  greater  than  that  required  on-chip.  Because 
of  this,  designs  that  minimize  off- chip  connections  are  favored.  The  net  result 
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of  these  limitations  is  that  designers  are  unable  to  include  all  of  the  features 
and  power  normally  available  in  conventional  computers.  Many  more  or  less 
standard  performance  enhancement  features,  such  as  complete  interrupt  sys- 
tems, index  registers,  address  modifications  and  indirect  addressing  tend  to 
be  excluded.  Of  course,  the  equivalent  of  these  features  can  be  performed  at 
the  expense  of  extra  execution  time.  Current  microcomputer  designs  provide 
modest  speed  and  throughput  capability.  Typically,  cycle  times  are  in  micro- 
seconds with  relatively  fast  register-to-register  operations,  and  slower,  multi- 
ple cycle  operations  when  memory  accessing  is  involved.  The  performance  of 
today's  microcomputers  is  roughly  comparable  with  the  minicomputers  that 
were  available  more  than  10  years  ago. 

3.  Application  Impact 

The  basic  task  of  all  computing  machines  is  the  efficient  execution 
(in  time  and  cost)  of  the  data  management  and  computing  tasks  of  the  intended 
application.  It  is  not  surprising  to  find  order  of  magnitude  speed  differentials 
between  one  machine  and  another.  Specialized  designs  for  an  application  tend 
to  be  superior  to  the  general  purpose  machine.  In  general,  a design  choice  is 
based  upon  cost,  flexibility,  reliability,  and  growth  potential  for  new  applica- 
tions. The  general  purpose  machine  generally  is  the  user's  choice  in  such 
tradeoff  comparisons  because  of  these  factors. 

The  selection  of  a candidate  computer  system  could  start  with  a 
principle  called  an  architectural  structure  matching.  The  match  consists  of 
considering  elements  in  design  and  combining  them  in  a way  to  complement 
one  another  so  that  an  optimal  cost  effective  computer  design  may  be  achieved. 

Five  primary  structural  elements  are  subject  to  tradeoff  criteria. 
These  are:  1)  technology,  2)  algorithm,  3)  data,  4)  programming  language  and 
5)  architectural  units.  These  are  discussed  in  the  following  paragraphs. 

a.  Technology 

Technology  tends  to  dominate  computer  machine  design  thinking. 
The  basic  speed  and  functional  blocks  available  inevitably  condition  the  feasi- 
bility of  the  design.  Some  of  the  aspects  of  technology,  such  as  logic  family 
performance  and  hardness  were  discussed  in  previous  sections.  Other  aspects 
are  the  system  interconnection  circuitry,  level  of  modularity,  the  availability  of 
multiple  functional  modules  and  specialized  arithmetic  hardware  to  supplement 
primitive  computing  functional  elements.  These  factors  enable  the  utilization 
and  specialization  of  dedicated  computer  operations.  In  the  next  section,  a few 
applications  are  discussed  and  the  effects  of  special  computing  features  which 
are  important  to  their  effective  execution  are  examined. 
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b.  Algorithms 

A variety  of  digital  computer  computational  algorithms  exist 
and  are  applicable  to  a wide  class  of  military  oriented  problems.  Three  data 
processing  tasks  were  discussed  in  Section  n.  Kalman  Filtering,  Radar  Beam 
Steering  and  GPS  Vehicle  Navigation  were  selected  as  a set  of  typical  space 
and  aircraft  computing  operations.  It  was  implicitly  assumed  that  they  could 
be  performed  in  a dedicated  manner.  The  three  tasks  were  partitioned  from  a 
general  class  of  command  and  control  computing  problems.  Examination  of 
the  computational  operations  associated  with  the  three  functions  indicate  the 
need  for  some  common  instruction  capabilities.  A reasonably  sophisticated 
multiply-add  and  trigonometric  function  generation  facility  is  required  for  all 
three.  All  three  applications  would  benefit  from  fast,  efficient  vector,  matrix 
and  data  manipulation  instruction  repertoires.  Two  of  the  three  require. co- 
ordinate transformation  operations  requiring  possible  matrix  inversions,  a 
basic  multiply,  divide  and  add  computation.  Some  applications  allow  for  poly- 
nomial multiply  operations  and  look  up  tables  to  substitute  for  divisions  in 
co-ordinate  transformations.  Good  address  preparation  and  modification 
facilities  dominate  in  this  area. 

The  Beam  Steering  problem  emphasizes  the  need  for  high  speed 
data  movement  from  vehicle  interfaces  and  high  speed,  minimum  instruction 
set  requirements.  The  Kalman  Filter  and  the  GPS  navigation  problem  do 
not  require  much  data  movement  but  need  high  precision  "number  crunching" 
with  medium  performance  compared  to  the  Beam  Steering  problem.  Opti- 
mizing each  application's  needs  and  having  them  complement  one  another  leads 
to  the  acceptance  of  current  minicomputer  design  concepts.  Some  compro- 
In  ihis  area  is  necessary  and  one  approach  is  to  tailor  each  computer’s 
hardware  with  standard  building  blocks.  To  be  more  generic  and  to  encom- 
pass more  applications,  some  further  discussion  of  digital  filter  algorithm  is 
beneficial. 


Recursive  filters  implemented  by  digital  computers  are  receiv- 
ing more  acceptance  with  the  improvement  in  computer  performance  and  reduc- 
tion in  conus  Any  order  filter  can  be  implemsfllad  in  hardware  as  an  intercon- 
nection of  second  order  sections  which  are  mathematically  represented  by  the 
function: 


Y = AY 
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This  function  represents  the  basic  operations  of  a whole  set  of 
basic  transforms,  beside  having  a counterpart  in  Kalman  filtering  and  poly- 
nomial expansions  and  approximations  U the  coefficients  and  variables  are 
considered  vectors,  we  have  the  core  computation  which  the  dedicated  efficient 
microcomputer  executes. 


Examination  of  the  Euler  transformations  associated  with  ve- 
hicle navigation  indicates  the  usefulness  of  a high  speed,  simple  method  to 
generate  trigonometric  functions.  The  Cordic  algorithm  in  which  shifting 
and  adding  is  used  to  generate  most  of  these  functions  rapidly  and  simplistic- 
ally  can  be  developed  in  a dedicated  microcomputer  and  pipelined  into  the 
major  tasks  of  filtering  when  necessary.  Where  time  is  not  critical,  trigo- 
nometric functions  can  be  generated  by  the  standard  power  series  expansions 
for  the  desired  range  of  precision.  The  iteration  intervals  to  complete  re- 
quired algorithms  have  been  given  for  the  three  tasks.  From  this  informa- 
tion one  can  appreciate  that  one  more  dedicated  microcomputer  is  needed. 

The  Beam  Steering  problem  requires  the  largest  number  of  simple  compu- 
tations. Although  limited  arithmetic  precision  is  required,  the  speed  of  the 
operations  saturates  any  contemporary  microcomputer. 

c.  Data  Structure 

Some  of  the  impact  of  data  structures  has  been  implied  in  the 
discussion  of  algorithms.  For  the  second  order  filter  sections,  the  number  of 
bits  in  the  data  word  comprising  the  coefficients  is  directly  related  to  the  vari- 
able word  size.  The  number  of  multiplications  and  adds  per  function  and  per 
unit  time  affects  the  precision  obtained  in  arithmetic  processing.  A variable 
word  size  matched  to  the  application  is  most  cost  effective  for  the  application 
but  not  necessarily  for  the  hardware.  Most  problems  could  be  solved  with  a 
16  bit  double  precision  arithmetic  capability  or  24  bit  single  precision  capa- 
bility. Eight  bit  machines  generally  pay  the  penalty  of  excessive  execution 
time  and  may  not  even  be  able  to  accomplish  the  required  tasks.  Processor 
word  length  is  clearly  related  to  the  application  area,  e.g. , 12  bits  generally 
for  radar,  16  to  18  bits  for  communications  and  16  to  32  for  navigation  and 
guidance.  These  choices  affect  scaling,  speed,  frequency  of  overflow,  mem- 
ory size,  multiplier  time,  truncating,  rounding  and  overall  system  cost. 

Using  a microcomputer  to  cover  such  a range  of  requirements  requires  con- 
siderable compromise . 

Memory  structure  and  memory  data  access  are  also  areas  for 
tradeoffs.  Partitioned  memories,  separate  instruction  and  data  caches  be- 
tween program  and  arithmetic  units  and  scratch  pad  banks  acting  as  part  of  the 
Arithmetic  Logic  Units  have  their  roles  in  improving  computer  performance. 
Expandable  microcomputer  system  designs  evolving  out  of  current  minicom- 
puters are  trying  to  benefit  from  past  experience  by  incorporating  these 
features . 


Floating  point  is  a desirable  add-on  feature  for  programmers 
and  may  be  required  for  the  sophisticated  navigation  problem,  which  requires 
large  numbers  of  vector  and  scaling  operations. 
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d.  Programming  Language 

Examination  of  the  current  and  projected  microcomputers 
indicates  a minimum,  in  many  eases,  and  complete  lack,  in  other  cases,  of 
software  support.  For  the  stand  alone  microcomputer  system,  reasonable 
software  support  is  available  if  it  is  an  archetype  of  a vendor's  previously 
conceived  computer  family.  Building  your  own  architecture  and  developing 
your  own  instruction  set  through  microprogramming  at  the  most  primitive 
level  is  a tedious  time-consuming  task.  Most  microcomputer  programming 
is  constrained  to  the  assembly  language  level;  however,  only  a few  vendors  at 
present  are  providing  this  capability.  For  the  present,  higher  level  language 
programming  of  microcomputers  does  not  appear  to  be  efficient  enough  to 
warrant  consideration.  One  exception  is  the  8 BOL  language  developed  by  GE 
for  the  CRD8  microcomputer.  Its  uniqueness  and  efficiency  is  predicated  upon 
its  very  close  match  to  the  hardware  implementation.  This  probably  will  be 
the  trend  for  microcomputer  systems.  Another  approach  would  be  to  establish 
a set  of  primitives  for  special  purpose  processing  algorithms  when  the  micro- 
computer is  in  a very  dedicated  computing  environment. 

e.  Hardware  Architecture 

Four  factors  discussed  previously  are  useful  in  appraising  a 
computer  vendor's  product.  From  a generic  problem  perspective  based  on 
the  candidate  applications,  the  system  structure  should  be  modular  in  nature 
and  should  be  built  from  a set  of  microcomputers  either  in  a single  or  multi- 
ple chip  microprogrammed  form.  Each  of  the  microcomputers  should  have  a 
flexible,  variable  data  and  word  length  of  one  to  four  eight-bit  bytes,  with 
flexible  direct  and  indirect  memory  addressing  and  moderate  performance 
arithmetic  capability.  The  modular  elements  that  comprise  memory  and 
input- output  are  assumed  to  be  compatible  with  the  central  processor  and 
peripherals. 


With  these  characteristics  as  a baseline  and  with  a suitable 
microcomputer,  a higher  level  computing  organization  may  be  considered 
(using  a systems  bottom-up  approach  versus  a system  top-down  approach 
which  also  is  feasible).  Tradeoffs  to  be  made  in  the  specification  of  the 
hardware  include  specialized  memories  for  data,  microprogram,  and  macro 
program  storage,  specialized  arithmetic  units,  such  as  blister  multiply  or 
floating  point  units,  hardware  parallelism  and  instruction  pipelining,  and  the 
speed,  power  and  cost  tradeoffs  between  sequential  and  combinational  logic 
implementations . 


4.  Benchmark  Characterization 


A tradeoff  analysis  to  determine  which  microcomputer  to  use  in  a 
given  application  would  be  difficult  to  conclude,  due  to  the  proliferation  of  de- 
vices with  various  architectures  and  attributes.  A typical  approach  to  narrow 
the  choice  is  to  apply  benchmark  programs  to  the  candidates  to  determine 
processing  time  and  memory  requirements.  Typical  benchmarks  would  meas- 
ure the  time  and  memory  required  for  movement  and  manipulation  of  data, 
storing  and  retrieving  status  upon  interrupt,  searching  for  parameters  in 
memory,  monitoring  peripheral  devices,  number  crunching,  etc.  As  one 
could  guess,  the  closer  the  microcomputer  approaches  a sophisticated  mini- 
computer in  performance,  the  more  likely  choice  for  the  applications. 

One  criterion  in  the  selection  process  is  instruction  power.  As  an 
illustration,  consider  the  effects  of  adding  two  eight-bit  quantities  stored  in 
memory  and  returning  the  sum  to  memory.  The  sequence  of  instructions  to 
perform  this  operation  takes  177.5  ps  using  the  Intel  8008.  Now  suppose  this 
instruction  is  to  be  stored  in  a list.  In  a conventional  machine  which  has  an 
index  register,  a single  instruction  would  accomplish  this.  The  sequence 
takes  120  ps  for  the  8008.  As  a third  example,  suppose  the  contents  of  the 
accumulator  are  to  be  stored  in  an  address  specified  by  the  contents  of  a 
pointer.  A conventional  machine  would  execute  a single  instruction  in  one 
memory  cycle.  The  Intel  8008  could  do  this  operation  in  122,  5 ps.  A con- 
siderable time  loss  with  the  8008  is  obvious  compared  to  a conventional 
machine . 


Instruction  power  improvement  is  obviously  obtained  when  the  Intel 
8080  is  considered.  This  machine  is  similar  to  the  8008  but  there  are  several 
improvements.  Perhaps  the  most  important  is  the  execution  time.  The  im- 
provement varies  with  instructions,  but  the  8080  is  on  the  order  of  six  times 
faster  than  the  fastest  8008.  The  most  obvious  change  is  the  bus  structure. 
The  8080  has  a separate  16-bit  memory  address  bus,  which  avoids  the  time 

consuming  need  for  multiplexing  a single  bus.  The  separate  bus  also  elimi- 
nates the  need  to  latch  the  address  and  the  associated  logic.  Through  im- 
provements in  addressing,  stack  point  and  word  length  movements,  consider- 
able execution  efficiencies  are  achieved.  Trial  programming  gives  execution 
times  for  the  add  to  memory  problem  of  24  ps,  the  store  in  a list  takes  17  ps 
and  the  store  at  a pointer  takes  21.  5 ps. 

Both  the  8008  and  the  8080  are  deficient  in  their  ability  to  perform 
multiplication.  An  eight-by-eight  bit  multiplication  giving  a 16 -bit  product 
using  the  8080  takes  about  260  ps.  Since  this  time  is  not  acceptable  for  most 
number  crunching  applications,  one  solution  is  to  provide  specialized  multiply 
hardware  blisters.  The  time  can  then  be  reduced  to  approximately  20  ps 
using  efficient  instruction  coding. 
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Benchmarking  a set  of  computers  for  an  application  can  be  a tedious 
job.  A thorough,  working  knowledge  of  the  instruction  set  of  each  candidate 
is  required.  A subject  of  great  controversy,  benchmarking  has  been  called  a 
black  art  in  decision  making  by  computer  experts. 

D.  ARCHITECTURE  SPECIFICATION 

1.  General  Requirements 

Considering  the  application  analyses  conducted  during  this  study 
and  anticipating  other  present  and  future  applications,  certain  candidate  com- 
puter selection  criteria  are  evident.  To  achieve  cost  effective  utilization,  a 
hardened  microcomputer  system  should  meet  the  following  constraints: 

• operates  in  a military  environment 

• readily  available 

• low  system  cost 

• highly  flexible 

• high  speed 

• low  power 

• support  software  available 

• instruction  sophistication 

• radiation  resistant 

Rather  than  designing  new  hardware  and  support  software,  the  ap- 
proach of  using  an  existing  design  with  documented  software  is  recommended. 
The  user  can  realize  lower  costs  and  quicker  deliveries  with  this  latter  ap- 
proach. Although  hardening  constraints  might  require  modification  of  the  chip 
layout  or  electrical  design,  retaining  the  overall  hardware  architecture  will 
still  provide  considerable  benefits  to  the  user. 

The  available  devices  can  be  classified  as  either  byte  slice  or  fixed 
word  length  machines.  Sixteen  bits  is  the  current  maximum  for  the  fixed  word 
length  devices.  The  applications  which  were  examined  require  numerical 
computations  with  accuracies  of  16  to  32  bits.  In  order  to  achieve  the  longer 
word  lengths  with  the  current  technology,  a byte  slice  architecture  is  indicated. 
The  smaller,  four-bit  and  eight-bit,  fixed  word  length  machines  would  be  more 
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suitable  in  other  applications,  such  as  dedicated  controllers  command  se- 
ances and  slow  speed  interface  units.  These  devices  could  have  their 
place  in’hardened  systems.  The  bit  slice  machine  is  not  the  best  answer  for 
all  applications,  but  it  provides  the  necessary  flexibility  to  the  designer 
satisfy  longer  word  length  requirements. 

For  the  real  time,  numerical  computation  applications,  execution 

100  ns  or  less),  and  for  controllers  (approximately  10  jis  or  more). 

The  bit  slice  architectures  do  not  include  instruction  decoding  logic 

sophistication. 

The  level  of  radiation  hardness  depends  on  the  application.  The  re- 
quirementj for' various  appiications  are  shown  in  Table  21  The  ability  of  the 
indicated  technology  to  satisfy  the  requirements  is  shown  m Table  a. 
dielectrically  isolILd,  ic,  power  Schottky  TTL  tecta the 
radiation  resistant,  low  power  and  high  speed  criteria  stated  initially. 


TABLE  21.  RADIATION  HARDNESS  REQUIREMENTS 


Environment 

1 

Application 
2 3 

4 

Neutron  Fluence 
(1  Mev  Equivalent) 

1010 

io12 

1012 

n 

3 x 1014 

Work  through 
Dose  Rate  (Rad-Si/s) 

1 

- 

108 

Survival  Dose  Rate 
(Rad-Si/s) 

5 x 106 

108 

A .pH 

4 x 10 

5 x 1012 

Total  Dose  (Rad -Si) 

- 

5 x 103 

106 

106 

f 


TABLE  22.  TECHNOLOGY  HARDNESS  CAPABILITY 


Technology 

1 

Application 
2 3 

4 

Limit 

CMOS 

X 

X 

• 

7 = 

s 

To111 

CMOS/SOS 

X 

X 

7 = 

3 x 105 

TTL 

X 

X 

7 = 

s 

io10 

DI/TTL 

X 

X 

X 

X 

IIL 

X 

X 

7 - 

s 

1011 

ECL 

X 

X 

N = 

io12,  7 = io10 

EFL 

X 

X 

? 

? 

• 

7 = 

S 

? (>9  x 1010) 

2.  Architecture  Recommendation 

Characteristics  of  the  various  microprocessor  products  from  over 
31  vendors  who  manufacture  over  48  different  types  have  been  examined.  The 
products  have  been  categorized  as  to  their  manufacturing  technology,  bit  size, 
CPU/slice/hybrid  availability,  number  of  instructions,  machine  cycle  time, 
I/O  speed,  memory  capabilities  and  software  support.  This  examination/ 
evaluation  was  accomplished  under  a GE  Standard  Avionics  Microprocessor 
Development  Study  initiated  in  1974  and  is  still  an  on-going  project.  As  a re- 
sult of  this  effort  and  the  objectives  expected  to  be  obtained  by  utilization  of 
hardened  microcomputers  in  the  present  study,  the  following  is  recommended. 

A distributed  microcomputer  system  composed  of  byte  slice  struc- 
tured microcomputers,  using  standard,  basic  building  blocks  is  applicable  to 
most  of  the  tasks  at  hand  and  those  visualized  for  the  future.  This  will  allow 
off-loading  of  highly  sophisticated  large  scale  data  processing  computers  used 
in  centralized  data  management  organizations.  Throughput  will  be  enhanced, 
and  fault  tolerant  characteristics  will  be  achieved.  Sparing  of  facilities  using 
redundant,  low  cost  LSI  modules  thereby  becomes  feasible.  In  those  cases 
where  sophisticated  centralization  is  not  necessary,  a family  of  these 
standardized  microcomputers  can  be  tailored  into  functional  systems  and 
subsystems.  The  byte  slice  architecture  offers  the  designer  flexibility  in 
satisfying  various  word  length  requirements.  Radiation  hardening  require- 
ments should  be  easier  to  meet  than  the  computer  on  a chip  device.  Since 
hardening  will  probably  require  more  silicon  area,  the  chip  manufacturer 


can  retain  the  same  microcomputer  architecture  by  going  to  a smaller  slice 
in  order  to  retain  finite  yields. 

3.  Byte  Slice  Comparison 

Three  candidate  microcomputer  byte  slice  architectures  are  under 
consideration  since  they  represent  a reasonable  compromise  of  the  desired 
characteristics.  The  choice  of  one  over  the  others  rests  on  the  ability  to  develop 
a powerful  instruction  set  using  microprogramming  hardware  and  intercon- 
necting the  basic  building  block  computing  elements  effectively.  A Data 
General  Nova  Class  microcomputer  using  less  than  50  LSI  chips,  including 
4K/16-bit  memory  is  feasible  with  MMI  5701  or  the  Intel  3002.  Various 
characteristics  of  the  slice  devices  are  compared  in  Table  23. 

The  Intel  two-bit  bipolar  ALU  will  not  make  a fast  minicomputer. 

It  cannot  operate  on  two  arbitrary  registers  at  one  time,  and  just  about  all 
minicomputers  need  this  capability  to  be  as  fast  as  existing  designs  using 
Schottky  ALU's  and  register  files.  Two  register  operation  is  also  required 
for  the  memory  addressing  modes  in  most  minicomputers. 

The  Intel  3002  will  not  be  speed  effective  with  existing  1 ps  mini- 
computers and  hence,  will  not  efficiently  emulate.  The  5701  will  be  at  least 
twice  as  fast  as  Intel  at  the  macro  instruction  execution  level , with  four  bits 
per  unit  versus  two  for  Intel.  (Typical  macro  instructions  would  be  Register- 
to- Register  Operation  with  indexed  or  relative  addressing.)  MMI  power  dissi- 
pation is  33%  less  than  Intel,  and  MMI  package  area  for  four  bits  is  28% 
smaller. 


Intel's  microinstruction  cycle  is  twice  as  fast  as  MMI,  but  MMI 
typically  does  two  to  four  Intel  operations  in  one  cycle.  Since  each  instruction 
must  come  from  an  off  chip  ROM,  MMI  will  require  at  least  50%  less  ROM/ 
PROM  words  than  Intel  for  any  minicomputer  design.  Intel's  device  is  not  a 
computer  ALU  like  the  5701,  but  a controller  building  block. 

Controllers,  such  as  those  for  tape  and  disc  units,  do  not  require 
many  minicomputer  level  instructions.  They  do,  however,  require  bit  testing, 
bit  masking,  and  bit  manipulations  (such  as  set  a bit,  clear  a bit,  test  a bit, 
etc. ).  Intel  has  a separate  input  port  into  their  ALU  which  can  permit  bit 
manipulations.  The  5701  can  also  do  this,  but  the  mask  word  must  first  be 
brought  into  the  internal  RAM,  and  then  can  be  used  in  conjunction  with  the 
data  in. 


In  systems  requiring  continual  monitoring  of  various  bits  to  indicate 
peripheral  activity,  Intel's  unit  will  be  faster  than  the  5701.  Applications  in 
the  area  include  bus  controllers,  floppy  disk  and  tape  controllers,  display 
controllers,  etc. 
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TABLE  2c.  BIT  SLICE  COMPARISON 


Parameter 


Slice  Width 

Power  Dissipation  (typical) 

Internal  Registers 
Logic  Functions 

AND,  OR,  NOT,  EXCLUSIVE  NOR 
TRANSFER 
FORCE  0000,  1111 
Arithmetic 

Increment,  Decrement 
2’s  Complement 
Add 

Subtract 
Shift  Right 
Shift  Left 

Zero  Detect,  Overflow  Detect  Flags 
Accumulator  Extension 
Military  Devices 
16x16  Multiply  (Add  + Shift) 

ALU  Cycle  (worst  case) 
Register/Register 

Add 

Subtract 
Exclusive  OR 
External  ALU  Input  Ports 
External  ALU  Output  Ports 
Effective  Address  Calculation 
Typical  16  Bit  Minispeed 
(no  pipeline;  load,  store, 
register-to-register  add) 


Intel 

3002 

TI 

0400 

MMI 

5701 

2 

4 

4 

Bits 

750 

112 

1000 

mW 

13-2  bit 

10-4  bit 

17-4  bit 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

« 

No 

No 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

No 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

Yes 

No 

No 

Yes 

No 

Yes 

Yes 

No 

Yes 

Yes 

16 

20 

5 

/IS 

100 

1000 

250 

ns 

200 

2000 

250 

ns 

300 

2000 

250 

ns 

200 

2000 

250 

ns 

3 

1 

1 

2 

1 

1 

600 

3000 

250 

ns 

2.8 

12 

1.5 

lis 

Application  Area 


Controllers  Minicomputers 
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Intel  also  offers  the  3001  microprogram  control  unit  (MCU).  This 
unit  handles  the  sequence  control  section  of  the  CPU.  Intel's  MCU  is  quite 
limited  in  capability  since  it  is  controller  oriented.  The  unit  cannot  jump 
between  arbitrary  states,  and  cannot  have  microprogrammed  subroutines.  It 
does,  however,  save  20  TTL  packages. 

Intel's  MCU  is  available,  along  with  a priority  interrupt  chip  (re- 
places five  TTL  packages),  and  a bi-directional  bus  driver  (a  few  TTL  pack- 
ages). MMI's  answer  to  the  MCU  is  the  PLA  and  FPLA.  Intel's  priority 
interrupt  chip  does  not  allow  masking  of  interrupt  levels  or  dynamically 
changing  interrupt  priorities,  and  is  not  well  suited  for  interrupt  structures 
in  machines  other  than  controllers. 

Both  the  MMI  and  Intel  devices  use  bipolar  Schottky  technology, 
while  the  TI  device  uses  IIL.  The  speed-power  tradeoff  of  IIL  is  apparent  in 
the  comparison  with  the  Schottky  devices. 

Architectural  deficiencies  in  the  Intel  device  include  the  lack  of 
a subtract  instruction,  no  accumulator  extension  and  the  lack  of  external 
flags.  The  latter  characteristic  also  applies  to  the  TI  device.  These  charac- 
teristics are  important  for  fast  execution,  microprogram  control,  and  effic- 
ient multiply /divide  operations.  Of  the  types  considered  in  this  study,  the 
5701  device  has  the  best  characteristics  for  the  minicomputer  application. 

The  Intel  device  will  provide  superior  performance  in  controller  applica- 
tions. The  TI  slice  is  best  suited  in  low  speed,  low  power  applications. 

E.  FAULT  TOLERANCE 

Providing  the  hardware  and  software  necessary  to  achieve  fault  tolerant 
computer  operations  in  a microcomputer  system  requires  the  utilization  of 
many  reliability  enhancement  techniques.  Approaches  which  are  applicable  to 
fault  tolerant  design  include  intrinsic  reliability  of  components,  built-in-test- 
equipment  (BITE),  redundancy  techniques,  replacement  and  sparing,  and  fault 
masking.  Any  or  all  of  these  techniques  may  be  augmented  with  software  sup- 
port. The  techniques  which  are  utilized  in  an  application  must  be  cost  effective 
with  respect  to  the  criticality  of  the  mission.  Most  vendors  of  computer  prod- 
ucts consider  reliability  enhancement  of  their  designs  as  add-on  features  and  do 
not  incorporate  fault  tolerant  characteristics  at  the  lowest  levels.  The  com- 
puter system  architect  attempts  to  achieve  system  fault  tolerance  at  high 
modular  levels.  The  higher  ihe  level  in  general,  the  more  feasible  the  fault 
tolerant  design  and  the  more  cost  effective.  With  present  technology,  sparing 
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and  replacement  techniques  are  sufficient  to  give  reliable  fault  free  com- 
puter system  performance  over  several  years.  A few  contemporary  LSI 
designs  are  purported  to  provide  better  than  100,000  hour  mean-time- 
between-failures  (MTBF)„  Several  of  these  could  achieve  95%  reliability 
for  as  much  as  a five  year  period  with  adequate  switching  and  replacement 
procedures.  Duplicating  and  triplicating  functions  for  error  detection,  iso- 
lation and  correction  must  be  justified  on  their  own  right  as  to  their  cost 
effectiveness  in  each  application.  Fault  masking  at  the  chip  level  is  still  at 
an  experimental  research  level  and  requires  considerable  investigation  be- 
fore it  can  be  suggested  or  utilized. 

Up  to  this  point,  circumvention  techniques,  part  of  a fault  tolerant 
hardened  methodology,  have  not  been  discussed.  Along  with  a selected 
architecture  using  a semiconductor  technology,  a magnetic  memory  tech- 
nology may  be  necessary.  Plated  wire  memory  is  available  to  provide  fault 
tolerant  backup.  The  safe- store  of  critical  information  may  be  essential  to 
survive  a temporary  system  power-down  caused  by  incident  radiation.  A mag- 
netic memory  safeguard  is  necessary  for  critical  programs,  maintaining  sys- 
tem status  and  for  interim  processing  of  critical  tasks.  A nonvolatile  mem- 
ory which  stores  bootstrap,  and  possibly  roll  back  and  recovery  programs  is 
essential  to  circumvention.  Fault  tolerant  techniques  must  be  designed  into 
a system  to  achieve  system  recovery  and  survivability  from  transient 
failures  and  catastrophic  stoppages.  Means  of  detecting  radiation  effects 
in  excess  of  intrinsic  device  tolerance  are  required  to  provide  for  timely 
enactment  of  circumventing  procedures.  Restart  and  recovery  procedures 
which  are  part  of  the  firmware  and  the  nonvolatile  fail  safe  stored  software 
programs  require  tailor  made  instructions.  The  so  called  Program  Triple 
Store  instructions  in  which  critical  status  data  is  stored  in  triplicate  is  a 
practical  fault  masking  technique.  Adequate  hardware  interrupt  facilities 
provide  a means  of  achieving  prompt  recoveries  from  failures.  For  immi- 
nent failures  of  a permanent  nature,  good  hardware  interrupt  facilities  pro- 
vide a means  of  alerting  the  system  promptly  of  the  current  failure  so  that  a 
spare  replacement  procedure  can  be  instituted.  When  spares  are  not  avail- 
able, some  form  of  graceful  degradation  may  be  achieved  by  alerting  another 
computer  in  the  system,  if  one  is  available,  to  take  on  new  tasks  formerly 
assigned  to  the  failed  unit  if  that  computer  is  not  fully  loaded;  or  if  it  is,  to 
give  up  some  nonessential  tasks  it  is  dedicated  to,  in  order  to  permit  it  to  do 
so.  Graceful  degradation  techniques  are  highly  dependent  upon  the  multicom- 
puting-multiprocessing computer  organization  configurations,  and  super- 
visory operating  system  structures.  In  the  following  sections  a distributed 
federated  organization  of  computers  is  discussed  in  which  fault  tolerance  is 
achieved  by  graceful  degradation  along  with  sparing  and  replacement  haidware 
strategies. 
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F.  MULTIPLE  COMPUTER  ORGANIZATIONS 

Centralized  and  distributed  multiprocessing  computer  organizations  have 
been  a subject  of  controversy  and  tradeoffs  for  many  years.  Each  organiza- 
tion has  its  advantages  and  disadvantages.  Until  the  evolution  of  the  low  cost 
LSI  microcomputer,  most  computer  users  favored  the  centralized  multiproc- 
essing organizations  because  of  proven  efficiencies.  Recent  technological  de- 
velopments have  caused  the  pendulum  of  favoritism  to  swing  the  other  way. 

The  low  cost  LSI  microcomputer  provides  a means  of  developing  a reasonable, 
cost  effective,  federated  computer  system.  A group  of  federated  microcom- 
puters in  which  each  has  a dedicated  task  requires  effective  program  and  com- 
munication path  partitioning  between  processing  elements.  Localization  and 
dedication  of  specialized  tasks  requires  a set  of  regional  hardwired  buses,  each 
of  which  is  assigned  to  the  local  microcomputer  and  its  associated  peripherals 
or  subsystems.  Control  is  assigned  to  a dedicated  central  computer,  the  func- 
tion of  which  is  to  provide  supervisory  and  data  management  control  for  the 
regional  microcomputing  activities.  Although  the  federated  system  is  recom- 
mended for  a hardened  microprocessor  feasibility  study  and  implementation, 
the  centralized  multiprocessor  organization  also  can  serve  as  a test  bed. 

Two  diagrams  of  distributed  processing  organizations  are  presented  for 
illustration  using  LSI  microprocessing  candidate  multicomputers.  Figure 
35  follows  a classical  multiprocessor  format  in  which  the  computing  elements 
are  independent  and  are  configured  to  utilize  common,  shared  resources.  The 
multiprocessor  has  spares  at  the  ALU,  I/O  and  memory  element  level.  This 
organization  is  best  implemented  centralized  since  all  elements  are  functionally 
tied  to  one  another.  Throughput  is  variable  since  active  processors  depend 
upon  computational  load.  When  not  serving  as  throughput  enhancers,  the 
modules  becomes  spares.  The  structure  is  noted  for  its  graceful  degradation 
and  reliability  enhancement  features.  A reconfiguration  switching  processor 
is  replicated  (TMR)  and  represents  the  hard  supervisory  core  to  provide 
switching  and  replacement  strategies  for  the  system  to  permit  it  to  survive. 

Figure  36  is  an  example  of  a set  of  stand  alone,  monolithic  LSI  micro- 
computers in  a system  in  which  federated  supervision  and  task  dedication  is 
emphasized.  The  sparing  and  replacement  level  is  a modular  step  above  the 
previously  discussed  multiprocessing  organization.  Control  o*  activities  is 
assigned  to  a microcomputer,  the  counterparts  of  which  are  local  microcom- 
puters. Sparing  and  replacement  of  a failed  module  is  a function  of  the  hard 
core  triple  modular  redundant  (TMR)  microprocessing  unit.  Bus  structures 
for  the  centralized  microprocessor  organization  are  somewhat  different  than 
the  federated  system,  in  that  the  centralized  system  requires  more  sophisti- 
cated multiplexing  and  sw  tching.  Both  approaches  represent  sophisticated 
system  architectures  and  utilize  LSI  computer  technologies. 
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Figure  35.  Distributed  Multicomputer  System  - Candidate  1 


Reliability  enhancement  techniques  to  achieve  computer  fault  tolerance 
are  illustrated  by  these  organizations.  In  general,  the  architecture  supports 
and  supplements  the  microcomputing  stand  alone  elements.  These  approaches 
are  adjuncts  to  other  techniques  which  include  modular  element  reliability  en- 
hancement. A qualitative  comparison  of  both  organizations  is  provided  in 
Table  24. 
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Figure  36.  Distributed  Multicomputer  System  - Candidate  2 
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G.  ARCHITECTURE  EXAMPLES 


1.  High  Speed  ECL  Processor 

Although  not  available  as  a microprocessor  in  LSI  form,  the  High 
Speed  ECL  Processor  (HSP)  is  an  example  of  the  high  speed  which  can  be 
achieved  from  a simple  architecture  and  ECL  MSI  parts.  The  speed  of  the 
processor  makes  it  suitable  for  signal  processing  applications. 

The  block  diagram  of  the  HSP  is  shown  in  Figure  37,  and  its  char- 
acteristics are  summarized  in  Table  25.  The  hardware  simplicity  of  the 
16-bit  processor  is  evident  from  the  block  diagram.  Sophisticated  features 
such  as  multiple  general  registers  or  a stack  were  not  included  to  keep  the 
design  simple  to  allow  economical  use  of  processor  modules  in  a multiproces- 
sor organization.  Basic  instructions  which  do  not  require  a memory  reference 
are  executed  in  50  ns.  The  2's  complement  multiplication  of  two  16-bit  oper- 
ands will  yield  a 32-bit  product  in  900  ns. 

Motorola  has  announced  a bit  slice  ECL  processor  which  will  be 
available  in  1976.  The  family  includes  five  devices:  a four-bit  processor 
slice,  control,  timing,  memory  interface  and  slice  look  ahead  chips.  The 
microprogrammable  processor  features  a 75  ns  cycle  time  over  temperature 
and  MECL  10,000  logic  compatibility.  When  available,  this  processor  will 
provide  a significant  addition  to  the  high  performance  end  of  the  microprocessor 
spectrum. 


TABLE  25.  HSP  CHARACTERISTICS 


Word  Length 

16  bits 

Technology 

ECL  1 OK  MSI 

Memory  Capacity 

64K 

IC's  (w/o  memory) 

92 

Power  (w/o  memory) 

32  W 

Board  Area 

100  sq.  in. 

Cycle  Time 

50  ns 

16x16  Multiply 

900  ns 
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2.  Bit  Slice  Processor 


A block  diagram  of  a processor  which  was  designed  for  navigation, 
guidance  and  control  computations  is  shown  in  Figure  38.  Its  performance 
is  summarized  in  Table  26.  The  architecture  for  this  machine  was  chosen 
to  provide  a basis  for  comparison  to  the  bit  slice  machine  (BSP)  described  in 
Figure  39  and  Table  27.  The  navigation  computer  (NGC)  contains  plated 
wire  memory  for  scratch  pad  storage,  five  general  purpose  and  two  index 
registers.  The  machine  is  microprogrammed  and  is  several  steps  above  the 
HSP  in  architectural  complexity. 

TABLE  26.  NAVIGATION  COMPUTER  PERFORMANCE 


Word  Length 
Technology 
Volume 
Weight 
Power 
Instructions 
Load/Store 
Add 

Multiply  (24  bits) 


24  bits 
TTL  MSI 
236  cu.  in. 

11  lbs 
90  W 

33  x,  ~,  sin,  cos,  tan 
2.4  iis 
3.6  (is 
50  iis 


The  general  registers  and  arithmetic  logic  unit  (ALU)  are  con- 
tained in  the  register  and  arithmetic  logic  unit  (RALU)  slices  in  the  BSP. 

The  plated  wire  memory  and  the  external  macro  instruction  memory  are  re- 
tained. The  estimated  characteristics  are  shown  in  Table  27.  The  figures 
in  Table  27  do  not  include  the  plated  wire  memory,  but  typical  numbers  are 
included  for  the  other  memory  types.  The  integrated  circuit  count  and 
power  analysis  are  shown  in  Table  28. 


TABLE  27.  BIT  SLICE  PROCESSOR  CHARACTERISTICS 

Word  Length  24  bits 

Technology  TTL  LSI/MSI 

Board  Area  115  in.^ 


Power  28  W 

IC's  65 

Instructions  64 
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Figure  38 


Navigation  Computer  Block  Diagram 


TABLE  28. 


BIT  SLICE  PROCESSOR  CIRCUIT  TYPES 


i 


Typical  Power 

Function  Type  Quantity  Pins  Each  Total 

(mW)  (mW) 


REGISTERS 


Macro  Address,  11  bits 

54175 

2 

16 

225 

450 

Instruction,  24  bits 

54175 

4 

16 

225 

900 

PW  Address,  9 bits 

54175 

2 

16 

225 

450 

PW  Data  Out,  24  bits 

54175 

4 

16 

225 

900 

PW  Data  In,  24  bits 

54175 

4 

16 

225 

900 

♦Micro  Address,  9 bits 

54175 

2 

16 

225 

450 

Subtotal 

IF 

~4FF0 

MULTIPLEXERS 

Data  In,  4:1 

54153 

12 

16 

180 

2160 

RALU  Register,  2:1 

54157 

2 

16 

150 

300 

♦Next  Address,  2:1 

54157 

1 

16 

150 

150 

♦ROM,  2:1 

54157 

3 

16 

150 

450 

Subtotal 

18 

3060 

READ  ONLY  MEMORIES 

Macro,  2K  x 24 

3304 

12 

24 

700 

8400 

♦Starting  Address,  64  x 9 

54186 

2 

24 

300 

600 

Micro,  512  x 40 

3304 

5 

24 

700 

3500 

♦Flag,  128  x 4 

54187 

1 

16 

470 

470 

Subtotal 

20 

12970 

ARITHMETIC  UNIT 

RALU,  24  bits 

5701 

6 

40 

1150 

6900 

Look-Ahead 

54182 

3 

16 

180 

540 

Subtotal 

9 

7440 

Total  IC's 

65 

Total  Power  27.5  w 

*MCU  parts 
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Roth  Drocessors  require  a large  investment  in  microprogram  hard- 
warp  from  the  macro  instruction  memory  is 

MCU  were  available  as  a 40-pin  DIP  and  dissipated  1 0 W the  IC : count 
the  BSP  would  be  reduced  to  56,  and  the  power  dissipa  i 
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FROM  IR 


next  address 

FROM  fiP  ROM 
COLUMN  ROW 


FLAGS  # 
FROM  5701  • 


TO  fiP  ROM 


Figure  40.  Microprogram  Control  Unit 
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3.  General  Purpose  Processor 


The  BSP  shown  in  Figure  39  does  not  have  extensive  input/output, 
direct  memory  access  or  interrupt  facilities,  which  could  be  added  as  shown 
in  Figure  41.  These  features  would  require  additional  hardware,  board 
space,  and  power.  An  interrupt  processor  could  provide  the  interface  be- 
tween device  requests  and  the  MCU.  The  MCU  would  provide  mask  and 
enable  inputs  to  the  interrupt  processor,  which,  in  turn,  would  provide  a 
vector  jump  address  to  the  MCU.  The  interrupt  routine  for  the  device  would 
start  at  the  jump  address  in  the  microprogram.  The  interrupt  processor 
could  contain  priority  encoders  to  allow  the  processor  to  respond  quickly  to 
the  most  critical  input  or  output  device.  The  I/O  processor  would  provide 
the  digital  and  analog  interface  to  the  peripheral  devices.  The  PIA,  such  as 
those  in  the  Motorola  6800  family,  could  be  configured  to  serve  the  digital 
input  and  output.  If  the  I/O  processor  cannot  provide  the  required  speed, 
the  DMA  processor  can  be  used.  The  configuration  shown  in  Figure  41 
allows  the  peripheral  to  communicate  with  the  data  scratch  pad,  not  the  pro- 
gram store  memory.  The  memory  access  controller  arbitrates  access  re- 
quests betweenthe  DMA  processor  and  the  RALU  processor. 

The  microprogrammable  bit  slice  devices  have  placed  LSI  slices  of 
registers  and  arithmetic  logic  units  in  the  hands  of  the  designer.  As  the 
users  make  the  current  suppliers  aware  of  their  needs  for  I/O,  DMA,  and 
interrupt  processors,  LSI  devices  suitable  for  these  applications  will  also 
appear.  In  fact,  some  devices  (PIA's,  Modems,  Universal  Receiver/ 
Transmitters)  are  presently  available  in  the  I/O  area.  Although  the  RALU 
slices  presently  become  lost  in  the  supporting  logic,  the  situation  should 
change  in  the  future. 


SECTION  V 

SUMMARY  AND  CONCLUSIONS 


The  three  application  areas  which  were  studied  are  only  a small  sample 
of  potential  areas  for  dedicated  processors.  Fixed  word  controllers  were  not 
examined.  Preprocessors  dedicated  to  one  or  more  sensors  controlling  the 
analog  to  digital  conversion,  engineering  unit  conversion,  and  data  formatting 
would  not  require  sophisticated  machines,  but  would  relieve  the  downstream 
processor  of  tedious  work. 

The  applications  which  were  examined  illustrate  the  wide  range  of  proc- 
essor requirements  which  could  be  served.  Based  on  present  technology,  the 
bit  slice  microprocessor  is  the  best  vehicle  for  a standard  part  solution  to  the 
long  word  length  application  requirements.  The  present  bit  slice  machines 
are  microprogrammable,  allowing  the  user  to  generate  his  own  instruction 
set  to  tailor  the  processor  to  his  own  application  needs.  Alternatively,  a 
standard  instruction  set  could  be  provided  with  the  ability  to  add  special  in- 
structions at  the  option  of  the  user. 

The  selection  of  a technology  for  a radiation  hardened  microprocessor 
depends  on  the  radiation  environment  and  the  performance  goals  and  their 
priorities.  Dielectrically  isolated  TTL  is  the  only  current  technology  which 
will  meet  the  most  severe  boost  and  re-entry  radiation  environments.  Several 
technologies  can  meet  the  less  severe  airborne  requirements,  so  that  the 
selection  becomes  one  based  on  performance  criteria  (speed,  power),  cost 
and  availability,  which  are  subject  to  tradeoffs  by  the  user.  The  technologies 
of  the  currently  available  unhardened  bit  slice  microprocessors  are  Schottky 
TTL  and  IIL. 

There  are  two  basic  approaches  to  hardening  electronic  system  designs. 
The  first  is  to  utilize  the  intrinsic  hardness  of  the  various  existing  technologies 
and  specify  that  technology  or  mixture  of  technologies  which  will  meet  the  re- 
quirements. The  second  is  to  use  circuit  design  and  processing  techniques  in 
the  design  and  fabrication  of  custom,  hardened  components.  The  user  may 
have  no  choice  in  accepting  the  cost,  performance  and  risk  penalties  of  the 
latter  approach  if  his  requirement  cannot  be  met  by  existing  technology. 

Hardening  requirements  usually  require  speed-power  tradeoffs  in  the 
device  design.  Device  isolation  will  require  additional  silicon  area  so  that 
there  are  fewer  devices  per  unit  area.  As  a further  consequence,  the  total 
number  of  circuit  nodes  will  increase,  causing  additional  power  dissipation 
and  slowing  down  circuit  speed.  With  lower  device  density,  the  hardened 
computer  on  a chip  will  be  available  at  a later  point  in  time  than  the  unhardened 
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device.  Until  that  point,  the  hardened  computer  will  have  to  support  an  archi- 
tecture wliich  consists  of  a collection  of  simpler,  functional  components. 

The  recommended  architecture  for  a radiation  hardened  microprocessor 
is  based  on  a bit  slice  device.  A four-bit  slice  is  preferable  over  a two-bit 
slice  for  interconnect  and  area  considerations.  TTL  technology  provides  the 
best  combination  of  speed  and  power  characteristics  with  the  potential  of  the 
highest  radiation  hardness.  The  internal  architecture  of  the  slice  should  in- 
clude multiple  general  registers,  an  accumulator  extension  register  for 
multiply  and  divide  operations,  ALU  flags  such  as  carry,  overflow,  and  zero 
for  easy  branch  condition  testing,  and  separate  output  ports  for  address  and 
data.  Microprogrammability  would  offer  the  user  extensive  flexibility,  either 
in  specifying  a complete  instruction  set  or  in  adding  special  instructions. 

- 

Several  external  LSI  chips  are  desirable  to  support  the  bit  slice  chip 
and  reduce  the  SSI/MSI  device  count.  A Microprogram  Control  Unit  (MCU) 
would  interface  between  the  macro  instruction  and  microinstruction  memories 
by  examining  the  processor  flags  and  other  information  to  determine  the  ad- 
dress of  the  next  microinstruction.  An  Interrupt  Processing  Unit  (IPU)  would 
examine  the  incoming  service  requests  and  generate  address  information  to 
the  MCU  depending  on  processor  state,  priority  and  mask  information.  Input/ 

Output  Processors  (IOP)  would  serve  as  digital  channels  between  the  I/O  de- 
vices and  the  processor  which  would  provide  the  IOP  with  commands  and 
starting  data  and  then  return  to  other  calculations.  A Memory  Access  Con- 
troller would  oversee  the  limited  memory  resource  and  grant  requests  to  the 
DMA  and  IOP  channels,  and  the  processor. 

These  chips  partition  the  microcomputer  into  functional  LSI  chips,  which  I 

could  then  be  configured  in  various  quantities  to  satisfy  specific  application 
needs.  A 16-bit  microcomputer  with  the  power  of  a present  day  minicomputer 
could  be  composed  of  eight  LSI  chips  (four  slices,  an  IOP,  MAC,  MCU  and 
IPU),  not  including  memory. 

The  attractiveness  of  microcomputers  in  terms  of  size  and  the  resulting 
flexibility  will  eventually  result  in  their  use  in  space  and  airborne  applications. 

A family  of  hardened  microprocessors  is  desirable  and  technologically  feasible, 
although  their  availability  will  be  limited,  and  their  costs  high.  A program  to 
develop  a hardened  microprocessor  will  be  least  expensive  if  the  design  is 
based  on  an  existing  commercial  product  which  has  fully  developed,  documented 
support  software. 
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